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VASCULAR ENDOTHELIAL GROWTH FACTOR-X 

The present invention is concerned with a novel 
vascular endothelial growth factor (VEGF) herein 
designated S% VEGF-X", and characterisation of the 
nucleic acid and amino acid sequences of VEGF-X. 

Introduction . 

Angiogenesis involves formation and proliferation of 
new blood vessels/- and is an essential physiological 
process for normal growth and development of tissues 
in, for example, embryonic development, tissue 
regeneration and organ and tissue repair. 
Angiogenesis also features in the growth of human 
cancers which require continuous stimulation of blood 
vessel growth. Abnormal angiogenesis is associated 
with other diseases such as rheumatoid arthritis 
psoriasis and diabetic retinopathy. 

Capillary vessels consist of endothelial cells which 
carry the genetic information necessary to proliferate 
to form capillary networks. Angiogenic molecules 
which can initiate this process have previously been 
characterised. A highly selective mitogen for 
vascular enothelial cells is vascular endothelial 
growth factor (VEGF) (Ferrara et al., w Vascular 
Endothelial Growth Factor: Basic Biology and Clinical 
Implications". Regulation of angiogenesis, by I.D. 
Goldberg and E.M. Rosen 1997 Birkhauser Verlag 
Basle/Switzerland) . VEGF is a potent vasoactive 
protein which is comprised of a glycosylated cationic 
46-49 kd dimer having two 24 kd subunits. It is 
inactivated by sulfhydryl reducing agents and is 
resistant to acidic pH and to heating and binds to 
immobilised heparin. 
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VEGF-A has four different forms of 121, 165, 189 and 
206 amino acids respectively due to alternative 
splicing. VEGF121 and VEGF165 are soluble and are 
capable of promoting angiogenesis, whereas VEGF189 and 
5 VEGF206 are bound to heparin containing proteoglycans 
in the cell surface. The temporal and spatial 
expression of VEGF has been correlated with 
physiological proliferation of the blood vessels 
(Gajdusek, C-M., and Carbon, S.J., Cell Physiol., 
10 139:570-579, (1989)); McNeil, P.L., Muthukrishnan, L., 
Warder, E,, D'Amore, P. A., J. Cell. Biol., 109:811- 
822, (1989)). Its high affinity binding sites are 
localized only on endothelial cells in tissue sections 
(Jakeman, L.B., et al., Clin. Invest. 89:244-253 
□ 15 (1989)). The growth factor can be isolated from 

hi pituitary cells and several tumor cell lines, and has 

been implicated in some human gliomas (Plate, K.H. 
M Nature 359:845-848, (1992)). The inhibition of VEGF 

i: function by anti-VEGF monoclonal antibodies was shown 

sj 20 to inhibit tumor growth in immune-deficient mice (Kim, 

K.J., Nature 362:841-844, (1993)). 

H] VEGF proteins have been described in the following 

patents and applications all of which are hereby 

% 25 incorporated by reference EP-0, 506, 477, WO-95/24473, 

WO-98/28621, WO-90/13649, EP-0, 476, 983, EP-0, 550, 296, 
WO-90/13649, WO-96/26736, WO-96/27007, WO-98/49300, 
WO-98/36075, WO-98/840124, WO-90/11084, WO-98/24811, 
WO-98/10071, WO-98/07832, WO-98/02543, WO-97/05250, 
30 WO-91/02058, WO-96/39421, WO-96/39515, WO-98/16551. 

The present inventors have now identified a further 
vascular endothelial growth factor, designated herein 
as "VEGF-X", and the nucleic acid sequence encoding 
35 it, which has potentially significant benefits for the 
treatment of tumours and other conditions mediated by 
inappropriate angiogenic activity. 
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Summary of the Invention 

In the present application, there is provided a novel 
vascular endothelial growth factor, herein designated 
5 U VEGF-X", nucleic acid molecules encoding said growth . 
factor, an expression vector comprising said nucleic 
acid molecule, a host cell transformed with said 
vector and compounds which inhibit or enhance 
angiogenesis. Also provided is the sequence of a CUB 
10 domain present in the sequence of VEGF-X which domain 
itself prevents angiogenesis and which is used to 
treat diseases associated with inappropriate 
vascularisation or angiogenesis. 

15 Detailed Description of the Invention 

Therefore, according to a first aspect of the present 
invention there is provided a nucleic acid molecule 
encoding a VEGF-X protein or a functional equivalent, 

20 fragment, derivative or bioprecursor thereof, said 
protein comprising the amino acid sequence from 
position 23 to 345 of the amino acid sequence 
illustrated in Figure 10. Alternatively, the nucleic 
acid molecule of the invention encodes the complete 

25 sequence identified in Figure 10 and which 

advantageously includes a signal peptide to express 
said protein extracellularly . Preferably, the nucleic 
acid molecule is a DNA and even more preferably a cDNA 
molecule- Preferably, the nucleic acid molecule 

30 comprises the nucleotide sequence from position 257 to 
1291 of the nucleotide sequence illustrated in Figure 
9* In a preferred embodiment the nucleic acid is of 
mammalian origin and even more preferably of human 
origin. 



In accordance with the present invention a functional 



equivalent should be taken to mean a protein, or a 
sequence of amino acids that have similar function to 
the VEGF-X protein of the invention. 

Also provided by this aspect of the present invention 
is a nucleic acid molecule such as an antisense 
molecule capable of hybridising to the nucleic acid 
molecules according to the invention under high 
stringency conditions, which conditions would be well 
known to those skilled in the art* 

Stringency of hybridisation as used herein refers to 
conditions under which polynucleic acids are stable. 
The stability of hybrids is reflected in the melting 
temperature (Tm) of the hybrids. Tm can be 
approximated by the formula: 

81 . 5°C+16. 6 (log 10 [Na + ] +0.41 (%G&C) -600/1 

wherein 1 is the length of the hybrids in nucleotides, 
Tm decreases approximately by 1-1 with every 1% 
decrease in sequence homology. 

The term ^stringency" refers to the hybridisation 
conditions wherein a single-stranded nucleic acid 
joins with a complementary strand when the purine or 
pyrimidine bases therein pair with their corresponding 
base by hydrogen bonding. High stringency conditions 
favour homologous base pairing whereas low stringency 
conditions favour non-homologous base pairing. 

"Low stringency" conditions comprise/ for example, a 
temperature of about 37 °C or less, a formamide 
concentration of less than about 50%, and a moderate 
to low salt (SSC) concentration; or, alternatively, a 
temperature of about 50°C or less, and a moderate to 
high salt (SSPE) concentration, for example 1M NaCl. 



"High stringency" conditions comprise, for example, a 
temperature of about 42°C or less, a formamide 
concentration of less than about 20%, and a low salt 
(SSC) concentration; or, alternatively, a temperature 
of about 65°C, or less, and a low salt (SSPE) 
concentration* Tor example, high stringency 
conditions comprise hybridization in 0,5 M NaHP0 4 , 7% 
sodium dodecyl sulfate (SOS), 1 mM EDTA at 65°C 
(Ausubel, F.M. et al. Current P rotocols in Molecular 
Biology , Vol, I, 1989; Green Inc. New York, at 
2.10.3) . 

W SSC" comprises a hybridization and wash solution. A 
stock 20X SSC solution contains 3M sodium chloride, 
0.3M sodium citrate, pH 7.0. 

U SSPE" comprises a hybridization and wash solution. A 
IX SSPE solution contains 180 mM NaCl, 9mM Na 2 HP0 4 and 
1 mM EDTA, pH 7.4. 

The nucleic acid capable of hybridising to nucleic 
acid molecules according to the invention will 
generally be at least 70%, preferably at least 80 or 
90% and more preferably at least 95% homologous to the 
nucleotide sequences according to the invention . 

The antisense molecule capable of hybridising to the 
nucleic acid according to the invention may be used as 
a probe or as a medicament or may be included in a 
pharmaceutical composition with a pharmaceutically 
acceptable carrier, diluent or excipient therefor. 

The term "homologous" describes the relationship 
between different nucleic acid molecules or amino acid 
sequences wherein said sequences or molecules are 
related by partial identity or similarity at one or 
more blocks or regions within said molecules or 



sequences. 



The present invention also comprises within its scope 
proteins or polypeptides encoded by the nucleic acid 
molecules according to the invention or a functional 
equivalent, derivative or bioprecursor thereof. 

Therefore, according to a further aspect of the 
present invention, there is provided a VEGF-X protein, 
or a functional equivalent, derivative or bioprecursor 
thereof, comprising an amino acid sequence from 
position 23 to 345 of the sequence as illustrated in 
Figure 10, or alternatively which amino acid sequence 
comprises the complete sequence of Figure 10, A 
further aspect of the invention comprises a VEGF-X 
protein, or a functional equivalent, derivative or 
bioprecusor thereof, encoded by a nucleic acid 
molecule according to the invention. Preferably, the 
VEGF-X protein encoded by said nucleic acid molecule 
comprises the sequence from position 23 to 345 of the 
amino acid sequence as illustrated in Figure 10, or 
which sequence alternatively comprises the sequence of 
amino acids of Figure 10. 

The DNA molecules according to the invention may, 
advantageously, be included in a suitable expression 
vector to express VEGF-X encoded therefrom in a 
suitable host. Incorporation of cloned DNA into a 
suitable expression vector for subsequent 
transformation of said cell and subsequent selection 
of the transformed cells is well known to those 
skilled in the art as provided in Sambrook et al. 
(1989), molecular cloning, a laboratory manual, Cold 
Spring Harbour Laboratory Press. 

An expression vector according to the invention 
includes a vector having a nucleic acid according to 



the invention operably linked to regulatory sequences, 
, such as promoter regions, that are capable of 
effecting expression of said DNA fragments. The term 
"operably linked" refers to a juxta position wherein 
the components described are in a relationship 
permitting them to function in their intended manner. 
Such vectors may be transformed into a suitable host 
cell to provide for expression of a polypeptide 
according to the invention. Thus, in a further 
aspect, the invention provides a process for preparing 
polypeptides according to the invention which 
comprises cultivating a host cell, transformed or 
transfected with an expression vector as described 
above under conditions to provide for expression by 
the vector of a coding sequence encoding the 
polypeptides, and recovering the expressed 
polypeptides . 

The vectors may be, for example, plasmid, virus or 
phage vectors provided with an origin of replication, 
and optionally a promoter for the expression of said 
nucleotide and optionally a regulator of the promoter. 

The vectors may contain one or more selectable 
markers, such as, for example, ampicillin resistance. 

Regulatory elements required for expression include 
promoter sequences to bind RNA polymerase and 
transcription initiation sequences for ribosome 
binding. For example, a bacterial expression vector 
may include a promoter such as the lac promoter and 
for translation initiation the Shine-Dalgarno sequence 
and the start codon AUG. Similarly, a eukaryotic 
expression vector may include a heterologous or 
homologous promoter for RNA polymerase II, a 
downstream polyadenylation signal, the start codon 
AUG, and a termination codon for detachment of the 



ribosome. Such vectors may be obtained commercially 
or assembled from the sequences described by methods 
well known in the art. 

Nucleic acid molecules according to the invention may 
be inserted into the vectors described in an antisense 
orientation in order to provide for the production of 
antisense RNA. Antisense RNA or other antisense 
nucleic acids may be produced by synthetic means. 

In accordance with the present invention, a defined 
nucleic acid includes not only the identical nucleic 
acid but also any minor base variations including in 
particular, substitutions in cases which result in a 
synonymous codon (a different codon specifying the 
same amino acid residue) due to the degenerate code in 
conservative amino acid substitutions- The term 
"nucleic acid sequence" also includes the 
complementary sequence to any single stranded sequence 
given regarding base variations. 

The present invention also advantageously provides 
nucleic acid sequences of at least approximately 10 
contiguous nucleotides of a nucleic acid according to 
the invention and preferably from 10 to 50 nucleotides 
even more preferably, the nucleic acid sequence 
comprise the sequences illustrated in Figure 3. These 
sequences may, advantageously be used as probes or 
primers to initiate replication, or the like. Such 
nucleic acid sequences may be produced according to 
techniques well known in the art, such as by 
recombinant or synthetic means. They may also be used 
in diagnostic kits or the like for detecting the 
presence of a nucleic acid according to the invention. 
These tests generally comprise contacting the probe 
with the sample under hybridising conditions and 
detecting for the presence of any duplex or triplex 



formation between the probe and any nucleic acid in 
the sample. 

The nucleic acid sequences according to this aspect of 
the present invention comprise the sequences of 
nucleotides illustrated in Figures 3 and 5. 

According to the present invention these probes may be 
anchored to a solid support. Preferably, they are 
present on an array so that multiple probes can 
simultaneously hybridize to a single biological 
sample. The probes can be spotted onto the array or 
synthesised in situ on the array, (See Lockhart et 
al. f Nature Biotechnology, vol. 14, December 1996 
"Expression monitoring by hybridisation to high 
density oligonucleotide arrays". A single array can 
contain more than 100, 500 or even 1,000 different 
probes in discrete locations. 

The nucleic acid sequences, according to the invention 
may be produced using such recombinant or synthetic 
means, such as for example using PCR cloning 
mechanisms which generally involve making a pair of 
primers, which may be from approximately 10 to 50 
nucleotides to a region of the gene which is desired 
to be cloned, bringing the primers into contact with 
mRNA, cDNA, or genomic DNA from a human cell, 
performing a polymerase chain reaction under 
conditions which brings about amplification of the 
desired region, isolating the amplified region or 
fragment and recovering the amplified DNA. Generally, 
such techniques are well known in the art, such as 
described in Sambrook et ai. (Molecular Cloning: a 
Laboratory Manual, 1989) . 

The nucleic acids or oligonucleotides according to the 
invention may carry a revealing label. Suitable 
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labels include radioisotopes such as 32 P or 35 S, enzyme 
labels or other protein labels such as biotin or 
fluorescent markers. Such labels may be added to the 
nucleic acids or oligonucleotides of the invention and 
may be detected using known techniques per se. 

Advantageously, human allelic variants or 
polymorphisms of the DNA molecule according to the 
invention may be identified by, for example, probing 
cDNA or genomic libraries from a range of individuals, 
for example, from different populations. Furthermore, 
nucleic acids and probes according to the invention 
may be used to sequence genomic DNA from patients 
using techniques well known in the art, such as the 
Sanger Dideoxy chain termination method, which may, 
advantageously, ascertain any predisposition of a 
patient to certain disorders associated with a growth 
factor according to the invention. 

The protein according to the invention includes all 
possible amino acid variants encoded by the nucleic 
acid molecule according to the invention including a 
polypeptide encoded by said molecule and having 
conservative amino acid changes. Conservative amino 
acid substitution refers to a replacement of one or 
more amino acids in a protein as identified in Table 
1. Proteins or polypeptides according to the invention 
further include variants of such sequences, including 
naturally occurring allelic variants which are 
substantially homologous to said proteins or 
polypeptides. In this context, substantial homology 
is regarded as a sequence which has at least 70%, 
preferably 80 or 90% and preferably 95% amino acid 
homology with the proteins or polypeptides encoded by 
the nucleic acid molecules according to the invention. 
The protein according to the invention may be 
recombinant, synthetic or naturally occurring, but is 



preferably recombinant. 



The nucleic acid or protein according to the invention 
may be used as a medicament or in the preparation of a 
medicament for treating cancer or other diseases or 
conditions associated with expression of VEGF-X 
protein. 

Advantageously, the nucleic acid molecule or the 
protein according to the invention may be provided in 
a pharmaceutical composition together with a 
pharmacologically acceptable carrier, diluent or 
excipient therefor. 

The present invention is further directed to 
inhibiting VEGF-X in vivo by the use of antisense 
technology. Antisense technology can be used to 
control gene expression through triple-helix formation 
of antisense DNA or RNA, both of which methods are 
based on binding of a polynucleotide to DNA or RNA. 
For example, the 5' coding portion or the mature DNA 
sequence, which encodes for the protein of the present 
invention, is used to design an antisense RNA 
oligonucleotide of from 10 to 50 base pairs in length. 
A dna oligonucleotide is designed to be complementary 
to a region of the gene involved in transcription 
(triple-helix - see Lee et al. Nucl. Acids Res., 
6:3073 (1979); Cooney et al., Science, 241:456 (1988); 
and Dervan et al., Science, 251: 1360 (1991), thereby 
preventing transcription and the production of VEGF-X. 
The antisense RNA oligonucleotide hybridises to the 
mRNA in vivo and blocks translation of an mRNA 
molecule into the VEGF-X protein (antisense - Okano, 
J- Neurochem. , 56:560 (1991); Oligodeoxynucleotides as 
Antisense Inhibitors of Gene Expression, CRC Press, 
Boca Raton, FL (1988)). 
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Alternatively, the oligonucleotide described above can 
be delivered to cells by procedures in the art such 
that the anti-sense RNA and DNA may be expressed in 
vivo to inhibit production of VEGF-X in the manner 
described above. 

Antisense constructs to VEGF-X, therefore, may inhibit 
the angiogenic activity of VEGF-X and prevent the 
further growth of or even regress solid tumours, since 
angiogenesis and neovascularization are essential 
steps in solid tumour growth. These antisense 
constructs may also be used to treat rheumatoid 
arthritis, psoriasis and diabetic retinopathy which 
are all characterized by abnormal angiogenesis. 

A further aspect of the invention provides a host cell 
or organism, transformed or transfected with an 
expression vector according to the invention. The 
host cell or organism may advantageously be used in a 
method of producing VEGF-X, which comprises recovering 
any expressed VEGF-X from the host or organism 
transformed or transfected with the expression vector. 

According to a further aspect of the invention there 
is also provided a transgenic cell, tissue or organism 
comprising a transgene capable of expressing VEGF-X 
protein according to the invention. The term 
"transgene capable of expression" as used herein means 
a suitable nucleic acid sequence which leads to 
expression of VEGF-X or proteins having the same 
function and/or activity. The transgene, may include, 
for example, genomic nucleic acid isolated from human 
cells or synthetic nucleic acid, including DNA 
integrated into the genome or in an extrachromosomal 
state. Preferably, the transgene comprises the 
nucleic acid sequence encoding the proteins according 
to the invention as described herein, or a functional 
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fragment of said nucleic acid* A functional fragment 
of said nucleic acid should be taken to mean a 
fragment of the gene comprising said nucleic acid 
coding for the proteins according to the invention or 
a functional equivalent, derivative or a non- 
functional derivative such as a dominant negative 
mutant, or bioprecursor of said proteins. For 
example, it would be readily apparent to persons 
skilled in the art that nucleotide substitutions or 
deletions may be used using routine techniques, which 
do not affect the protein sequence encoded by said 
nucleic acid, or which encode a functional protein 
according to the invention. 

VEGF-X protein expressed by said transgenic cell, 
tissue or organism or a functional equivalent or 
bioprecursor of said protein also forms part of the 
present invention. 

Antibodies to the protein or polypeptide of the 
present invention may, advantageously, be prepared by 
techniques which are known in the art. For example, 
polyclonal antibodies may be prepared by inoculating a 
host animal, such as a mouse or rabbit, with the 
polypeptide according to the invention or an epitope 
thereof and recovering immune serum. Monoclonal 
antibodies may be prepared according to known 
techniques such as described by Kohler R. and Milstein 
C, Nature (1975) 256, 495-497. Advantageously, such 
antibodies may be included in a kit for identifying 
VEGF-X in a sample, together with means for contacting 
the antibody with the sample. 

Advantageously, the antibody according to the 
invention may also be used as a medicament or in the 
preparation of a medicament for treating tumours or 
other diseases associated with expression of VEGF-X. 
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The invention also further provides a pharmaceutical 
composition comprising said antibody together with a 
pharmaceutical^ acceptable carrier diluent or 
excipient therefor. 

Proteins which interact with the polypeptide of the 
invention may be identified by investigating protein- 
interactions using the two-hybrid vector system first 
proposed by Chien et al., (1991) Proc. Natl. Acad. 
Sci. USA 88 : 9578-9582. 

This technique is based on functional reconstitution 
in vivo of a transcription factor which activates a 
reporter gene. More particularly the technique 
comprises providing an appropriate host cell with a 
DNA construct comprising a reporter gene under the 
control of a promoter regulated by a transcription 
factor having a DNA binding domain and an activating 
domain/ expressing in the host cell a first hybrid DNA 
sequence encoding a first fusion of a fragment or all 
of a nucleic acid sequence according to the invention 
and either said DNA binding domain or said activating 
domain of the transcription factor, expressing in the 
host at least one second hybrid DNA sequence, such as 
a library or the like, encoding putative binding 
proteins to be investigated together with the DNA 
binding or activating domain of the transcription 
factor which is not incorporated in the first fusion; 
detecting any binding of the proteins to be 
investigated with a protein according to the invention 
by detecting for the presence of any reporter gene 
product in the host cell; optionally isolating second 
hybrid DNA sequences encoding the binding protein. 

An example of such a technique utilises the GAL4 
protein in yeast. GAL4 is a transcriptional activator 
of galactose metabolism in yeast and has a separate 



- 15 - 



domain for binding to activators upstream of the 
galactose metabolising genes as well as a protein 
binding domain. Nucleotide vectors may be 
constructed, one of which comprises the nucleotide 
5 residues encoding the DNA binding domain of GAL4. 

These binding domain residues may be fused to a known 
protein encoding sequence, such as for example, the 
nucleic acids according to the invention. The other 
vector comprises the residues encoding the protein 

10 binding domain of GAL4. These residues are fused to 
residues encoding a test protein. Any interaction 
between polypeptides encoded by the nucleic acid 
according to the invention and the protein to be 
tested leads to transcriptional activation of a 

15 reporter molecule in a GAL-4 transcription deficient 
yeast cell into which the vectors have been 
transformed. Preferably, a reporter molecule such as 
p-galactosidase is activated upon restoration of 
transcription of the yeast galactose metabolism genes. 

20 

A further aspect of the present invention also 
provides a method of identifying VEGF-X in a sample, 
which method comprises contacting said sample with an 
antibody according to the invention and monitoring for 
25 any binding of any proteins to said antibody. A kit 
for identifying the presence of VEGF-X in a sample is 
also provided comprising an antibody according to the 
invention and means for contacting said antibody with 
said sample. 

30 

VEGF-X may be recovered and purified from recombinant 
cell cultures by methods known in the art, including 
ammonium sulfate or ethanol precipitation, acid 
extraction, anion or cation exchange chromatography, 
35 phosphocellulose chromatography, hydrophobic 

interaction chromatography, affinity chromatography, 
hydroxyapatite chromatography and lectin 
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chromatography . 

The VEGF-X protein of the present invention may be a 
naturally purified product, or a product of chemical 
synthetic procedures, or produced by recombinant 
techniques from a prokaryotic or eukaryotic host (for 
example, by bacterial yeast, higher plant, insect and 
mammalian cells in culture) . Depending upon the host 
employed in a recombinant production procedure, the 
polypeptides of the present invention may be 
glycosylated with mammalian or other eukaryotic 
carbohydrates or may be non-glycosylated. 

VEGF-X is particularly advantageous as a wound healing 
agent, where, for example, it is necessary to re- 
vascularize damaged tissues, or where new capillary 
angiogenesis is important. Accordingly, VEGF-X may be 
used for treatment of various types of wounds such as 
for example, dermal ulcers, including pressure sores, 
venous ulcers, and diabetic ulcers. In addition, it 
can be used in the treatment of full-thickness burns 
and injuries where angiogenesis is desired to prepare 
the burn in injured sites for a skin graft and flap. 
In this case, VEGF-X or the nucleic acid encoding it 
may be applied directly to the wound. VEGF-X may be 
used in plastic surgery when reconstruction is 
required following a burn, other trauma, or even for 
cosmetic purposes. 

An important application of VEGF-X is to induce the 
growth of damaged bone, periodontium or ligament 
tissue. For example, it may be used in periodontal 
disease where VEGF-X is applied to the roots of the 
diseased teeth, leading to the formation of new bone 
and cementum with collagen fibre ingrowths. It can be 
used for regenerating supporting tissues of teeth, 
including alveolar bone, cementum and periodontal 



- 17 - 



ligament, that have been damaged by disease and 
trauma. 

Since angiogenesis is important in keeping wounds 
5 clean and non-infected, VEGF-X may be used in 

association with surgery and following the repair of 
cuts. It should be particularly useful in the 
treatment of abdominal wounds where there is a high 
risk of infection. 

10 

VEGF-X can also be used for the promotion of 
endothelialization in vascular graft surgery. In the 
case of vascular grafts using either transplanted or 
synthetic material, VEGF-X may be applied to the 

15 surface of the graft or at the junction to promote the 
growth of the vascular endothelial cells. One 
derivation of this is that VEGF-X can be used to 
repair the damage of myocardial and other occasions 
where coronary bypass surgery is needed by stimulating 

20 the growth of the transplanted tissue. Related to 
this is the use of VEGFX to repair the cardiac 
vascular system after ischemia. 

The protein of the present invention may also be 
25 employed in accordance with the present invention by 
expression of such protein in vivo, which is often 
referred to as "gene therapy". 

Thus, for example, cells such as bone marrow cells may 
30 be engineered with a polynucleotide (DNA or RNA) 

encoding for the protein ex vivo as defined herein, 
the engineered cells are then provided to a patient to 
be treated with the polypeptide. Such methods are 
well-known in the art. For example, cells may be 
35 engineered by procedures known in the art by use of a 
retroviral particle containing RNA encoding for the 
protein of the present invention. 
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Similarly, cells may be engineered in vivo for 
expression of the protein in vivo, for example, by 
procedures known in the art, 

A further aspect of the invention comprises a method 
of treating a disorder mediated by expression of a 
protein according to the invention, by administering 
to a patient an amount of an antisense molecule as 
described herein, in sufficient concentration to 
alleviate or reduce the symptoms of said disorder. 

Compounds which inhibit or enhance angiogenesis may be 
identified by providing a host cell or organism 
according to the invention or a transgenic cell, 
tissue or organism according to the invention, 
contacting a test compound with said cell, tissue or 
organism and monitoring for the effect of said 
compound compared to a cell tissue or organism which 
has not been contacted with said compound. These 
compounds may themselves be used as a medicament or 
included in a pharmaceutical composition for treatment 
of disorders mediated by inappropriate vascularisation 
or angiogenic activity. 

The present inventors have also, advantageously, 
identified in the sequence encoding the VEGF-X protein 
a CUB domain, which has heretofore not previously been 
identified in VEGF-type growth factors. The VEGF-X 
protein may therefore exert dual regulatory effects 
via interaction with the VEGF tyrosine kinase 
receptors or with neuropilin receptors mediated by the 
CUB domain. Thus, the sequence encoding said CUB 
domain may be included in an expression vector for 
subsequent transformation of a host cell, tissue or 
organism. 

VEGF-X or fragments thereof may be able to modulate 
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the effects of pro-angiogenic growth factors such as 
VEGF as indicated in the findings presented in the 
examples below that the N-terminal part of the VEGF-X 
protein, a CUB-like domain, is able to inhibit VEGF- 
stimulated proliferation of HUVECs. VEGF-X or 
fragments thereof may therefore be useful in therapy 
of conditions involving inappropriate angiogenesis. 
Inhibition of the angiogenic activity of VEGF has 
been linked with inhibition of tumour growth in 
several models eg Kim K, J. et al, Nature 362:841- 
844, (1993). Additionally, agents able to inhibit 
angiogenesis would be expected to be useful in 
treating other angiogenesis-dependent diseases such a 
retinopathy, osteoarthritis and psoriasis (Folkman, 
J., Nature Medicine 1:27-31, (1995), 

As identified in more detail in the Examples 
described herein the present inventors have 
surprisingly identified that the CUB domain of VEGF-X 
is able to inhibit stimulation of proliferation of 
HUVECs induced by either VEGF or bFGF. The CUB domain 
may, therefore, be utilised as a therapuetic agent 
for inhibition of angiogenesis and for treatment of 
condition associated with inappropriate 
vascularisation or angiogenesis. 

Therefore according to a further aspect of the 
invention there is provided a method of inhibiting 
angiogenic activity and inappropriate vascularisation 
including formation and proliferation of new blood 
vessels, growth and development of tissues, tissue 
regeneration and organ and tissue repair in a subject 
said method comprising administering to said subject 
an amount of a polypeptide having an amino acid 
sequence from position 40 to 150 of the sequence 
illustrated in Figure 10 or a nucleic acid molecule 
encoding the CUB domain according to the invention in 
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sufficient concentration to reduce or prevent said 
angiogenic activity. 

Furthermore there is also provided a method of 
5 treating or preventing any of cancer, rheumatoid 

arthritis, psoriasis and diabetic retinopathy, said 
method comprising administering to said subject an 
amount of a polypeptide having an amino acid sequence 
from position 40 to 150 of the sequence illustrated 
10 in Figure 10 or a nucleic acid molecule encoding the 
CUB domain according to the invention in sufficient 
concentration to treat or prevent said disorders. 

The CUB domain may also be used to identify compounds 
15 that inhibit or enhance angiogenic activity such as 

inappropriate vascularisation, in a method comprising 
contacting a cell expressing a VEGF receptor and/or a 
neuropilin 1 or 2 type receptor with said compound in 
the presence of a VEGF-X protein according to the 
20 invention and monitoring for the effect of said 

compound or said cell when compared to a cell which 
has not been contacted with said compound. Such 
compounds may then be used as appropriate to prevent 
or inhibit angiogenic activity to treat the disorders 
25 or conditions described herein, or in a 

pharmaceutical composition. An antibody to said CUB 
domain may also be useful in identifying other 
proteins having said sequences. 



\ 
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Deposited Plasmida 

Date of Deposit Accession No. 

Plasmid VEGFX/pCR2.1 
5 1T0P0 FL 1 March 1999 LMBP 3925 



Plasmid VEGFX/pRSETB BD 

amino acids 1 March 1999 LMBP 3926 

10 G230-G345 

Plasmid VEGFX/pcR. 2. 1 

FL Clone 9 20 October 1999 LMBP 3977 

15 Plasmid VEGF-X CUB 

PET22b 20 December 1999 

The above plasmids were deposited at the Belgian 
Coordinated Collections of Microorganisms (BCCM) at 
20 Laboratorium Voor Moleculaire Biologie- 

Plasmidencollectie (LMBP) B-9000, Ghent, Belgium, in 
accordance with the provisions of the Budapest Treaty 
of 28 April 1977. 

25 The invention may be more clearly understood with 
reference to the accompanying example , which is 
purely exemplary, with reference to the accompanying 
drawings, wherein : 

30 Figure 1: is a DNA sequence identified in the 

Incyte LifeSeq™ database coding for a 
novel VEGF-X protein. 



35 



Figure 2: 



is an illustration of amino acid 
sequence of the nucleic acid sequence 
of Figure 1. 



- 22 - 



Figure 3: is an illustration of PCR primer 

sequences utilised to identify the 
VEGF-X protein according to the 
invention. 



10 



Figure 4: is a diagrammatic illustration of the 

spatial relationships in the VEGF-X 
sequence of the clones identified 
using the PCR primer sequences of 
Figure 3. 



15 



Figure 5: is an illustration of the nucleotide 

sequences of the 5' RACE primers used 
to identify the 5' end of the VEGF-X 
open reading frame. 



Figure 6: is an illustration of the sequence 

obtained from the RACE experiment. 



20 



Figure 1 



is an illustration of the nucleotide 
sequences obtained from the search of 
LifeSeq™ database using the sequence 
in Figure 6. 



25 Figure 8: is an illustration of the primers used 

to clone the entire coding sequence of 
VEGF-X. 



30 



Figure 9: is an illustration of the entire 

coding sequence of VEGF-X. 



35 



Figure 10: is an illustration of the predicted 

amino acid sequence of the nucleotide 
sequence of Figure 9. 

Figure 11: is an alignment of the sequence of 
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Figure 10 with the sequences of VEGF-A 
to D. 



Figure 12: 



is an illustration of variant 
sequences of the VEGF-X protein 
according to the invention. 



Figure 13: is an illustration of the 

oligonucleotide primers used for 
E.coli expression of VEGF-X domains 
and for expression of the full length 
sequence of VEGF-X in a 
baculovirus/insect cell expression 
system. 

15 

Figure 14: depicts nucleic acid sequences of 18 

human EST clones obtained from a BLAST 
search of the LifeSeq™ database used 
to identify the full sequence encoding 
20 VEGF-X. 

Figure 15: depicts the nucleotide sequences of 50 

human EST clones obtained from the 
LifeSeq™ database. 



25 



30 



Figure 16: is an illustration of nucleotide 

sequences utilised as primers to 
identify the nucleotide sequence 
encoding VEGF-X. 

Figure 17: is a nucleotide sequence coding for a 

partial VEGF-X protein according to 
the invention. 



35 Figure 18: is an illustration of a partial 

nucleotide sequence encoding VEGF-X 
protein according to the invention. 



- 24 - 



Figure 19: 

5 

Figure 20: 

10 
15 

Figure 21: 

20 

25 Figure 22: 
30 



is an illustration of a DNA and 
polypeptide sequence used for 
mammalian cell expression of VEGF-X. 
The predicted VEGF-X signal sequence 
is in lower case letters- The C- 
terminal V5 epitope and His6 sequences 
are underlined. 

is an illustration of a DNA and 
polypeptide sequence used for 
baculovirus/insect cell expression of 
VEGF-X- In the polypeptide sequence 
the signal sequence is shown in lower 
case. The N-terminal peptide tag 
added to the predicted mature VEGF-X 
sequence is - underlined. 

is an illustration of a DNA and 
polypeptide sequence used for E. coli 
expression of VEGF-X- The polypeptide 
sequences at the N- and C- termini 
derived from the MBP fusion and HisS 
tag respectively are underlined. 

illustrates the disulphide-linked 
dimerisation of vegf-x. Protein 
samples were analysed by SDS-PAGE- 
Prior to loading the gel, samples were 
heated to 95°C for 5 minutes in sample 
buffer in the presence (+) or absence 
(-) of reducing agent. (A) samples 
from COS cell expression of a C- 
terminally V5/His6 peptide-tagged 
construct. The left hand panel is 
total conditioned medium, the right 
hand panel is material purified on 
Nickel agarose resin. Reduced monomer 
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10 
15 

Figure 23: 

20 



25 



Figure 24: 

30 



35 

Figure 25: 



and putative disulphide-linked, non- 
reduced dimer are indicated by arrows. 
There appears to be proteolysis of the 
protein during purification. Gels were 
blotted onto nylon membranes and 
protein detected with an anti V5 
monoclonal antibody. (B) Samples from 
£.co!i expression of a maltose-binding 
protein/His6 dual fusion construct. M 
indicates the molecular weight markers 
(Benchmark, Lif eTechnologies) . The 
gel was stained with Coomassie Blue by 
standard procedures. The fusion 
protein has an apparent molecular 
weight of 80kDa, 

illustrates the glycosylation of VEGF- 
X. vegf-X was purified from the 
culture supernatant of COS cells 
transfected with the pcDNA6/V5-His 
construct. Supernatants were 
harvested 72h post-transfection and 
purified on nickel resin. Samples 
were then treated with EndoH (+) or 
untreated (-) before SDS-PAGE and 
blotting, as described in the legend 
to Figure 22. 

is an illustration of the DNA and 
polypeptide sequence used for E. coli 
expression of the VEGF-like domain of 
VEGF-X. Polypeptide sequences at the 
N-terminus of the protein derived from 
the vector are underlined. 

shows expression of the VEGF-X VEGF 
domain in £. coli. Lane l-10pl broad 
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range marker (New England Biolabs), 
lane 2-lOjil unreduced sample, lane 3- 
10^1 reduced sample. The reduced PDGF 
domain protein (lane 3) has an 
5 apparent molecular weight of 

approximately 19kDa on SDS-PAGE. 



Figure 26: illustrates a DNA and polypeptide 

sequence used for E. coli expression 
10 of the CUB-like domain of VEGF-X. The 

polypeptide sequence at the N-terminus 
derived from the vector-encoded signal 
and the introduced His6 tag are 
underlined, 

15 

Figure 27: shows expression of the VEGF-X CUB 

domain in E. coli. The CUB domain 
protein was purified on Nickel chelate 
resin. The protein migrates at 
20 approximately 23kDa on SDS-PAGE. 



Figure 28: illustrates the effect of truncated 

VEGF-X (CUB domain) on HUVEC 
proliferation. (A) Human Umbilical 

25 Vein Endothelial Cells (one-day- 

treatment) . (B) Human Umbilical Vein 
Endothelial Cells (24-hour starving 
followed by one-day-treatment) . (C) 
Effect of VEGF-A 1W and VEGF-X CUB 

30 domain on the proliferation of HUVEC 

(two-day-treatment) . 



Figure 29: depicts the tissue distribution of 

VEGF-X mRNA analysed by Northern 
35 blotting and RT-PCR in (A) normal 

tissues and (B) tumour tissue and cell 
lines . 
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10 



15 



Figure 30: depicts the partial intron/exon 

structure of the VEGF-X gene. (A) 
Genomic DNA sequences of 2 exons 
determined by sequencing; exon 
sequence is in upper case, intron 
sequence is in lower case. (B) Shows 
the location of splice sites within 
the VEGF-X cDNA sequence. The 
location of mRNA splicing events is 
indicated by vertical lines. The 
cryptic splice donor/acceptor site at 
nt. 998/999 (diagonal lines) gives 
rise to the splice variant forms of 
VEGF-X. No splice site information is 
given for the region shown in italics. 



20 



Figure 31: is a graphic representation of the 

effect of FL-VEGF-X on HuVEC 
proliferation: (24 hour serum 
starvation followed by one day 
treatment) . 



Figure 32: is a graphic representation of the 

combined effect of truncated VEGF-X 
25 (CUB domain) and human recombinant 

VEGF 1€5 on HuVEC proliferation: (24 hour 
serum starvation followed by two day 
treatment) . 



30 Figure 33: is a graphic representation of the 

combined effect of the CUB domain and 
human recombinant bFGF on HuVEC 
proliferation: (24 hour serum 
starvation followed by two day 

35 treatment) . 



Figure 34: 



is a graphic representation of the 



results of a LDH assay for testing 
cytotoxicity of the CUB domain or the 
CUB domain with rhVEGF 165 . 

Figure 35: is a graphic representation of the 

results obtained from a LDH assay for 
testing cytotoxicity of the CUB domain 
or CUB domain with rh-bFGF. 

A BLAST (Basic Local Alignment Search Tool; Altschul 
et al., 1990 J. Mol. Biol. 215, 403-410) search was 
performed in the proprietary LifeSeq™ human EST 
database (Incyte Pharmaceuticals, Inc./ Palo Alto, 
CA, USA) . BLAST produces alignments of both 
nucleotide and amino acid sequences to determine 
sequence similarity. Because of the local nature of 
the alignments, BLAST is especially useful in 
determining exact matches or in identifying 
homologues. While it is useful for matches which do 
not contain gaps, it is inappropriate for performing 
motif-style searching. The. fundamental unit of BLAST 
algorithm output is the High-scoring Segment Pair 
(HSP) . 

Eighteen human EST clones (Figure 14) with high 
similarity to the previously identified VEGF proteins 
were identified and a further fifty EST clones 
(Figure 15) were identified using these sequences as 
query sequences, allowing us to deduce the putative 
sequence for the new VEGF-X protein. The sequences 
obtained were compared to known sequences to 
determine regions of homology and to identify the 
sequence as a novel VEGF- type protein. Using the DNA 
sequence information in the databases we were able to 
prepare suitable primers having the sequences of 
VEGF-X 1-10 illustrated in Figure 3 for use in 
subsequent RACE experiments to obtain the complete 



DNA sequence for the VEGF-X gene. 
Cloning 

A profile was developed based on the VEGF-like domain, 
in existing VEGF sequences (VEGF-A, B, C and D) . 
This was used to search the public databases and the 
Incyte LifeSeq™ database. No significant novel 
matching sequences were found in the public 
databases. All of the matching sequences found in 
the LifeSeq™ database (-1000) were assembled to give 
a smaller number of sequences (-30) , which included 
the known VEGFs and a potential novel VEGF (figures 
1 and 2) . This sequence was named VEGF-X. 

Oligonucleotides were designed to amplify the VEGF-X 
sequence from cDNA (figure 3) . The ESTs found in 
LifeSeq™ were from a range of tissues, with a slight 
predominance of sequences from ovary, testis, 
placenta and lung (Figure 14 and 15) . Accordingly 
the oligonucleotides were used to amplify cDNA 
derived from lung and placenta. First-round PCR 
products were found at -200bp larger than the 
expected sizes, while 3 major species appeared after 
a second round of PCR amplification, the smallest of 
which was of the expected size. These fragments were 
cloned and sequenced. The smallest fragment did 
indeed have the sequence originally identified from 
the LifeSeq database, while the others contained 
insertions (figure 4) , 

As the first round of amplification suggested that 
the major species found in cDNA from ovary and 
placenta was not that originally identified in the 
LifeSeq™ database, the focus of effort was switched 
to the presumed major species (it seemed likely that 



clones 57, 25-27 and 2.1kb clones 1-3 in fig 4 
represented the major mRNA species) . Conceptual 
translation of the DNA sequences of these cloned PCR 
fragments indicated that the complete open reading 
frame was not present in the clones or in the 
sequence from LifeSeq™. While all clones contained 
the same sequence in the region of the translation 
termination codon, indicating that the end of the 
open reading frame had been identified, the 5' end of 
the open reading frame had not been cloned. 5' RACE 
experiments were therefore carried out in order to 
find the start of the reading frame, PCR primers 
designed for RACE experiments are shown in figure 5. 
RACE PCR products were sequenced directly. Sequence 
could be obtained from the 3' end of these RACE 
products but not from the 5' end; probably because 
the products were not cloned and were therefore 
heterogeneous at the 5' end. This new sequence was 
assembled with the existing cloned sequence to give 
the sequence shown in figure 6. Searching the 
LifeSeq™ database with this sequence identifies ESTs 
which extend the sequence a further 140bp in the 5' 
direction and a further 160bp in the 3' direction 
(figure 7) , This longer contig was used to design 
oligonucleotide primers to amplify the entire coding 
sequence (these primer sequences are shown in figure 
8). PCR was carried out using primers 5'-l and 
vegfXIO (in order to clone a "full-length" cDNA) , and 
with primers 5'-l and vegfX6 (in order to clone the 
full coding region, see figure 3 for sequences of 
vegfXIO and vegfX6) . A number of clones were 
obtained for the shorter fragment, of which clones 4 
and 7 contain no PCR errors (sequence of clones 4 & 7 
in figure 9) . A single clone was obtained for the 
longer fragment (clone 9), but this sequence appears 
to contain 2 PCR errors. 
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The predicted polypeptide from these longer contigs 
is shown in figure 10. Amino acids 1-22 are 
predicted to encode a signal sequence (von Heijne, 
1986, Nucleic Acids Res. 14, 4683-4690). Figure 11 
shows an alignment of the protein sequence with VEGFs 
A-D. The region homologous to the other VEGFs is 
located towards the Oterminus of the protein. As 
the VEGF homology domain is expected to belong to the 
TGF-beta superfamily of growth factors and to consist 
of a dimer containing both intra- and intermolecular 
disulphide bonds, initial alignments focussed on the 
cysteines. However/ mapping of the sequence onto the 
known x-ray structure of the VEGF-A receptor-binding 
domain (Muller et al (1997) Proc. Natl. Acad. Sci USA 
94, 7192-7197) suggests that the alignment in figure 
11 is plausible, as the extra 4 cysteine residues 
within the VEGF-homology region of VEGF-X (compared 
to this region of VEGF-A) correspond to residues 
which are spatially close in VEGF-A, and may 
therefore be able to form disulphide bonds. 

A search of the PFAM database of protein domains with 
the full-length polypeptide sequence from figure 10 
identifies two domain consensus sequences within the 
polypeptide. The more C-terminal domain is a W VEGF" 
domain: (the known VEGFs all contain this domain and 
the structure of this region of VEGF-A is similar to 
that of PDGF) - Additionally towards the N-terminus 
of the polypeptide there is a CUB domain (amino acids 
-40-150) . The CUB domain is a 100-110 amino acid 
extracellular domain found in a number of 
developmentally-regulated proteins. When the full- 
length protein is used to search the protein 
databases using the BLAST 2 algorithm, the scores for 
matches to CUB domain-containing proteins are more 
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significant than those to the other VEGFs . 
Interestingly, the most significant matches are to 
the CUB domains of Neuropilins, and Neuropilin-1 was 
recently identified as a receptor of one of the VEGF- 
A isoforms VEGF-A 165 (Soker et al. (1998) Ceil 92, 735- 
745) . 

Assuming that the variant sequences isolated by PCR 
(i.e. the smaller PCR fragments) use the same 
translation initiation site as the full-length 
sequence, they would result in production of the 
variant proteins shown in figure 12. It may be 
significant that both of these variant proteins 
retain the CUB domain and delete all or part of the 
VEGF-like domain. The production of these variant 
sequences can be explained by the use of a cryptic 
splice donor/acceptor site within the VEGF-X sequence 
(figure 30B, between nt. 993/999) : one variant arises 
by splicing out of the region between nt. 729-998, 
the other by splicing out of the region between nt. 
999-1187. 

Expression 

Full-length expression constructs 
Mammalian cells 

Clone 4 containing the full CDS of VEGF-X (see figure 
9) , was used to generate constructs for expression of 
full-length protein. The sequence was amplified by 
PCR and cloned into the vector pCDNA6/V5-His so as to 
add a C-terminal V5 epitope tag and His 6 tag. The DNA 
and polypeptide sequence in this vector is shown in 
figure 19. Transient expression in COS cells 
followed by western blotting and detection via an 
anti-V5 mAb demonstrates the secretion of a protein 
of ~50K into the medium in transfected cells only 
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(figure 22A) . This construct can also be used to 
generate VEGF-X expressing stable CHO cell lines. 

Baculovirus/Insect-cell expression system 
For expression in the baculovirus/insect cell system 
the DNA encoding the predicted mature VEGF-X 
polypeptide sequence was fused to a sequence encoding 
a signal derived from melittin, a secreted insect 
protein. An N-terminal 6His tag was also added to 
facilitate purification. The insert was then cloned 
into the baculovirus expression vector pFASTBAC. The 
DNA and polypeptide sequence of this construct is 
shown in figure 20. Infection of Trichoplusia ni Hi5 
cells with this recombinant baculovirus results in 
the secretion of a protein of approximately 45K into 
the medium (data not shown) . 

E. coli 

The coding region of VEGF-X has been cloned in a 
variety of ways for expression as a secreted protein 
in E.coli. A particularly useful expression clone 
carries an N-terminal fusion to the E.coli 
maltose-binding protein (MBP- derived from the 
expression vector pMAL-p2, New England Biolabs) and a 
C-terminal fusion to a 6His tag. The DNA and 
polypeptide sequence of this vector is shown in 
figure 21. Sequential purification of cell fractions 
on Ni-NTA resin and amylose resin allows the 
isolation of the expressed protein (see figure 22B) - 

Expression of fragments 

VEGF 

The VEGF domain of VEGF-X has been expressed in 
E.coli. Similar domains from VEGF-A (Christinger et 
al. (1996) PROTEINS: Structure, Function and Genetics 
26, 353-357), and VEGF-D (Achen et al (1998) Proc. 



Natl. Acad. Sci USA 95, 548-553) have been shown to 
be capable of binding to the respective receptors. 
Expression of these domains was carried out using the 
bacterium E.coli. Additionally, the full-length 
protein. was expressed using the baculovirus/insect 
cell expression system. The oligonucleotide primers 
which have been obtained for these experiments are 
shown in figure 13. The construct directed 
expression in the bacterial cytoplasm, and as 
expected the protein was produced in insoluble form 
in inclusion bodies (the DNA and polypeptide sequence 
used for PDGF domain expression is shown in figure 
24) . Inclusion bodies were washed, solubilized with 
urea and the protein purified under denaturing 
conditions, before refolding by dialysis to remove 
the urea. Soluble protein was obtained, but shows 
little evidence of the disulphide bond linked dimers 
seen with material derived from animal cells (figure 
25, compare with figure 22A & B) . It is not clear 
therefore whether this protein is correctly folded. 

CUB 

The CUB domain has been expressed as a soluble 
secreted protein in E.coli (figure 26). The protein 
was purified by binding to Ni-NTA resin (figure 27) 
and assayed for activity on KUVECs in an in-vitro 
proliferation assay. 

Properties of the VEGF-X protein 

The transient mammalian cell expression system 
described above has been used to generate full-length 
VEGF-X protein, as shown by antibody detection 
following Western blotting (see figure 22A) . 

Disulphide bond linked dimers 

The other members of the PDGF family of growth 
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factors, the PDGFs and VEGFs, all exist as dimers in 
which two monoiners constituting the dimer are linked 
by interchain disulphide bonds. The x-ray structures 
of PDGF-BB (Oefner et al, 1992), and VEGF-A (Muller 
5 et al, 1997) are known and indicate that at least 
these two members of the family contain two 
interchain disulphide bonds. Practically this means 
that in SDS-.PAGE analysis of these growth factors the 
presence of interchain disulphide bonds is shown by a 

10 large decrease in mobility in the absence of reducing 
agent (ie. the nonreduced dimer migrates more slowly 
through the gel than the reduced monomer) . This 
effect was also expected for VEGF-X, and has been 
demonstrated for the material obtained from transient 

15 mammalian cell expression (figure 22A) . In the case 
of the full length material produced in E.coli only 
some 10% of the total VEGF-X protein appears to be 
present as disulphide bond-linked dimers (figure 
22B) . However, these results provide evidence that 

20 the mammalian cell-derived protein is correctly 
folded, and that a portion of the E. coli-derived 
protein is too. 

Glvcosylation 

25 There are 3 predicted potential N-linked 

glvcosylation sites within the VEGF-X protein: at 
residues 25, 55 and 254 of the polypeptide sequence. 
The predicted molecular mass of the mature VEGF-X 
protein is 40kDa, but SDS-PAGE and western blotting 

30 (detection via an introduced C-terminal epitope tag- 
see figure 19) of the full-length protein expressed 
in COS cells gives a band slightly larger than the 
expected size (45-50kDa) as well as one at 25kDa 
(figure 22A) . This smaller band is presumed to be a 

35 C-terminal proteolysis fragment derived from the 

full-length molecule (controls from uninfected cells 
do not show this band) , probably corresponding to a 



cleavage between the CUB and VEGF domains. EndoH 
treatment of the preparation gives a slight mobility 
change for the full-length protein (figure 23), but 
for the smaller VEGF domain fragment there is a clear 
change, indicating that the predicted glycosylation _ 
site within the VEGF domain at residue 254 is indeed 
glycosylated. 

Activity of proteins in cell-based assays 

Protein samples were tested for activity in cell 
proliferation/ cell migration and in^vitro 
angiogenesis assays. Active samples can also be 
tested in the in vivo matrigel mouse model of 
angiogenesis . 

Full-length VEGF-X protein 

Conditioned medium derived from COS cells transiently 
expressing VEGF-X (see figure 22A) displayed no 
detectable activity in any of the assays. However, 
as VEGF-X protein could only be detected in this 
preparation by Western blotting, and not by 
Coomassie-staining of gels, it is clearly present at 
very low levels and this may be the reason for the 
observed lack of activity in the cell proliferation, 
migration or in vitro angiogenesis tests. 

VEGF domain 

The VEGF domain protein described above has been 
tested in cell proliferation (on a range of cell 
types), cell migration and in vitro angiogenesis 
assays and has failed to show activity in any of 
these tests. As suggested above, this may be due to 
incorrect folding of this protein. 



CUB domain 

The CUB domain protein at the highest dose tested 



(lpg/ml) appears to inhibit proliferation of HUVBCs 
in the absence of other stimulation (figure 28A & B) . 
This effect is also seen following stimulation with 
the lowest VEGF-A 16S dose tested (Ing/ml- figure 28C) . 
The CUB domain of VEGF-X therefore appears to show . 
antiproliferative activity on HUVECs, even in the 
presence of low VEGF-A 165 doses. 

Tissue distribution of mRNA 
VEGF-A mRNA expression has been shown to be 
upregulated in a wide variety of human tumors (lung, 
breast, ovarian, colon, stomach, liver, pancreas, 
kidney, bladder and prostate- Takahashi et al, 1995) . 
Tumor VEGF-A expression has been shown to correlate 
with tumor growth rate, microvascular density and 
tumor metastasis (Takahashi et al, 1995) ■ It was 
thus of interest to examine the mRNA expression 
patterns of VEGF-X. Accordingly, Northern blot 
analysis of mRNA derived from different tissues has 
been carried out. The results indicate that although 
the VEGF-X mRNA is expressed at low levels, it is 
present in a wide range of tissues. PGR 
amplification of cDNA from a range of tissue sources 
supports this idea (figure 29A) . The major mRNA 
species is approximately 3.1kb in size. There is 
no significant upregulation seen in tumour cell lines 
or in tumour tissues tested (figure 29B), with the 
possible exception of the cell lines GI-117 (lung 
carcinoma) and SaOS-2 (osteosarcoma) . The results of 
these initial tissue distribution studies do not, 
therefore, provide evidence for upregulation of 
VEGF-X in tumour growth, as is seen with VEGF-A. 

Genomic structure ©£ the VEGF-X gene 

A genomic BAG clone covering the 3' part of the 

VEGF-X locus was isolated by hybridisation screening 
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of nylon filters containing a human BAC library. 
Direct sequencing of this clone using oligonucleotide 
primers based on the VEGF-X cDNA sequence allowed the 
determination of several intron/exon boundaries 
5 {figure 30) . Interestingly, the position of the mRNA 
splice site within the PDGF domain <nt 1187/1188 in 
figure 30B) is conserved with respect to those in the 
VEGF-A and VEGF-D genes (Tischer et al, 1991; 
Rocchigiani et al, 1998). 

10 

Materials & Methods 

PCR, Cloning, DNA sequence determination and BAC 
screening . 

15 All primers were purchased from Eurogentec, Seraing, 
Belgium. Insert-specific sequencing primers (15- and 16- 
mers) were designed by visual inspection of the DNA 
sequences. DNA was prepared on Qiagen-tip-20 columns or 
on Qiaquick spin columns (Qiagen GmbH, Dtisseldorf, 

20 Germany) and recovered from the spin columns in 30pl 
Tris/EDTA-buffer (lOmM TrisHCl pH 7,5, 1 mM EDTA (sodium 
salt) ) . Sequencing reactions were performed using 
BigDye™ Terminator Cycle Sequencing Ready Reaction kits 
(Perkin Elmer, ABI Division, Foster City, CA, USA) and 

25 were run on an Applied Biosystems 377 DNA sequencer 
(Perkin Elmer, ABI Division, Foster City, CA, USA) . 
Polymerase chain reactions were carried out according 
to standard procedures (Ausubel et al, 1997) . The 
PCR fragments were cloned into vectors pCR2 . 1 

30 (Invitrogen, Carlsbad, CA. USA) or pCR-TOPO 

(Invitrogen,NL) according to the manufacturer's 
instructions. One of those vectors, plasmid 
VEGFX/pCR2.1 1TOP0 FL 

was deposited on 1 March 1999 under Accession No. 
35 LMBP 3925, After sequence determination, the inserts 
were cloned into the desired expression vectors (see 
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figures 19, 20, 21, 24 & 26). 

A human genomic BAC library (Genome Systems, Inc., St 
Louis, Ml, USA) was screened by hybridisation to 
oligonucleotides derived from the VEGF-X cDNA 
sequence, according to the manufacturer's 
instructions. BAG DNA was prepared using a Qiagen 
plasmid midi kit (Qiagen GmbH, Dtisseldorf , Germany ) 
according to the manufacturer's instructions with 
some modifications (after clearing of the lysate from 
chromosomal DNA, supernatants from individual 
preparations were pooled on a single column {tip 
100), and after the 70 % EtOH wash, the pellet was 
resuspended overnight at 4°C in 100 yl TE) . 20-mer 
sequencing primers were designed based on the known 
cDNA sequence, and sequencing carried out as above • 

5' RACE 

In order to extend the cDNA clone in a 5' direction 
RACE reactions were carried out. Since it was known 
that the mRNA is present in placenta and skeletal 
muscle, Marathon-Ready™ placenta and skeletal muscle 
cDNAs were purchased from Clontech (Palo Alto CA. 
USA) and used according to the manufacturer's 
instructions- DNA fragments were excised from 
agarose gels, purified using QiaQuick PCR 
purification columns (Qiagen GmbH, Dtisseldorf, 
Germany) and sequenced directly, 

VEGF-X protein expression and purification 
DNA fragments encoding the desired protein sequences 
were amplified by PCR and cloned into appropriate 
expression vector systems. 

For mammalian cell expression, the full coding 
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sequence was cloned into the vector pcDNA6/V5-his 
(Invitrogen Leek, NL, see figure 19 for construct 
sequence), so as to add a C-terminal peptide tag to 
assist in detection and purification. 

5 

For insect cell expression the sequence of the 
predicted mature polypeptide was initially amplified 
to add an N-terminal 6His peptide and then cloned 
into the pMelBacB vector (Invitrogen, Leek, NL) to 
10 add an insect cell signal sequence . The entire 

insert was then PCR-cloned into the vector pFASTBAC-1 
(LifeTechnologies, Gaithersburg, MA, USA) for 
construction of a baculovirus according to the 
manufacturer's instructions. 

15 

For E.coli expression, the coding region was PCR 
amplified to add a C-terminal 6His tag and then 
cloned into the vector pMAL-p2 (New England Biolabs, 
Beverly, MA, USA) . The coding sequence of this 
20 construct is shown in figure 21), The protein was 
purified first on Ni-NTA resin (Qiagen GmbH, 
DUsseldorf, Germany) and then on amylose resin (New 
England Biolabs, Beverly, MA, USA), according to the 
manufacturers' instructions . 

25 

DNA sequences encoding the CUB and VEGF domain 
fragments of VEGF-X were PCR amplified and cloned 
into pET22b and pET21a (Novagen, Madison, WI, USA) 
respectively. The CUB domain protein was prepared 

30 either from the periplasm or medium of induced 

cultures by standard methods (Ausubel et al, 1997). 
The protein was initially purified by precipitation 
with 20% ammonium sulphate. After overnight dialysis 
vs 20mM Tris Hcl pH7.5, lOOmM NaCl to remove ammonium 

35 sulphate, the protein was further purified on Ni-NTA 
resin as described above. The VEGF domain protein 
was expressed in insoluble form, and preparation of 
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inclusion bodies was carried out using standard 
procedures (Ausubel et al 1997). Inclusion bodies 
were dissolved in SM guanidine hydrochloride, 20mM 
Tris Hcl pH8.0, 200mM NaCl, ImM 2-mercaptoethanol, 
and purified on Ni-NTA resin (Qiagen GmbH, 
Diisseldorf, Germany) according to the manufacturer's 
instructions. The protein was refolded by dialysis 
against several .changes of buffer containing 
decreasing concentrations of denaturant. 

Analysis of protein glycosylation was carried out 
using EndoH (Roche Molecular Biochemicals, Brussels, 
BE) according to the manufacturer's instructions. 

Cell Proliferation Assay 

Human umbilical vein endothelial cells (HUVECs) 
(Clonetics, San Diego, CA. ) were trypsinized with 
0.05% trypsin/0. 53mM EDTA (Gibco, Gaithersburg, MD. ) , 
resuspended in the EGM-2 (Clonetics, San Diego, CA.), 
counted, and distributed in a 96-well tissue culture 
plate at 5,000 cells/well. Following cell attachment 
and monolayer formation (16 hours), cells were 
stimulated with various concentrations of truncated 
VEGF-X (CUB domain or VEGF domain) or dilutions of 
culture supernatants of the full-length VEGF-X (COS 7 
or HEK293) in DMEM (Gibco, Gaithersburg, MD. ) 
containing 0.5% to 2% FBS (HyClone, Logan, UT) as 
indicated. For human fetal dermal fibroblasts 
(American Type Culture Collection, Rockville, MD.), 
the growth medium was replaced by DMEM containing 
0.1% BSA (Sigma, St. Louise, MO.) with or without 
various concentrations of truncated VEGF-X proteins. 
For HCASMC (Clonetics, San Diego, CA.), the medium 
was replaced by DMEM containing 0.5% FBS. The cells 
were treated for a further 24 hr-72 hr. For the 
measurement of proliferation, the culture media were 
replaced with 100 pi of DMEM containing 5% FBS and 3 



pCi/ml of [3H] -thymidine (Amersham, Arlington 
Heights, IL.). Following pulse labeling, cells were 
fixed with methanol/acetic acid (3:1, vol/vol) for 1 
hour at room temperature. The cells were washed 
twice with 250 pl/well of 80% methanol. The cells 
were solubilized in 0.05% trypsin (lOOpl/well) for 
30 minutes then in 0.5% SDS (100 pl/well) for another 
30 minutes- Aliquots of cell lysates (180 pi) were 
combined with 2 ml of scintillation cocktail (Fisher, 
Springfiled, NJ) and the radioactivity of cell 
lysates was measured using a liquid scintillation 
counter (Wallac 1409) . In each case, samples were 
performed in quadruplicate. 

Chemotaxis Assay 

The chemotactic response of HUVECs was assayed using 
a 48-well modified Boyden chamber (NeuroProbe, Cabin 
John, MD-) and collagen-coated (O.lmg/ml type I 
collagen, Collaboratic Biomedical, Bedford, MA,) 
polycarbonate membrane filters with a pore diameter 
of 8 pm (NeuroProbe, Cabin John, MD-). Cell 
suspensions (15, 000/well) were loaded to the upper 
part of the chemotaxis chamber and stimulated for 4 
hours with rhVEGF 165 (0.1-Z0 ng/ml) (Calbiochem, San 
Diego, CA. ) or various concentrations of truncated 
VEGF-X (PDGF domain) . Cells remaining on the top of 
the membrane were removed. Migration was assessed by 
counting the number of cells that migrated to the 
lower side of the filter membrane. The membrane was 
fixed with 10% formaldehyde for 15 min, followed by 
staining with Gill's hemotoxylin III (Poly 
Scientific, Bay Shore, NY.). The assay was performed 
in triplicates and six independent high power fields 
per well were counted using a light microscope at 250 
magnification. The results were expressed as the fold 
of unstimulated cells (EGM containing 0,1% BSA) . 
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In Vitxo Angiogenesis Assay 

In vitro angiogenesis in fibrin gels was quantitated 
using spheroids of human umbilical vein endothelial 
cells (Korff et al., 1998). To generate endothelial 
5 cell spheroids of defined si2e and cell number, a 
specific number of cells (- 800 cells per spheroid) 
was suspended in EGM-2 culture medium containing 20% 
methylcellulpse .(Sigma, St. Louis, MO.), seeded into 
nonadherent round-bottom 96-well plates. All 

10 suspended cells in one well contributed to the 
formation of a single endothelial cell spheroid 
within 24 hours. A fibrin gel stock solution was 
prepared freshly prior to use by mixing 3mg/ml 
fibrinogen (Calbiochem, San Diego, CA.) in Medium 

15 199(Gibco, Gaithersburg, MD.)- Assays were performed 
in 24-well culture plates. The 1ml fibrinogen stock 
was mixed with 50 HUVEC spheroids and the 
corresponding test substance including rh-VEGF 16S or 
various concentration of VEGF-X. The 

20 spheriod-containing fibrinogen was rapidly 

transferred into 24-well plates. Fifteen microliters 
of thrombin (100 NIH U/ml stock, Sigma, St. Louis, 
MO.) was added to the gel for the fibrin gel 
formation. The gel formation usually occurred within 

25 30 seconds- After gel formation, lml/well of Medium 
199 supplemented with 20% FBS, lmg/ml s-aminocaproic 
acid (Calbiochem, San Diego, CA.) and antibiotics 
were added. The gel was incubated at 37 °C (5%C0 2 , 95% 
air, 100% humidity). After 3 days, in vitro 

30 angiogenesis was quantitated by measuring the length 
of the three longest capillary sprouts that had grown 
out of each spheroid (100X magnification), analyzing 
at least 10 spheroids per experimental group and 
experiment . 

35 

Matrigel Mouse Assay 
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The matrigel mouse assay is carried out as described 
by Passanti et al (1992). 

Analysis of VEGF-X gene expression by RT-PCR 
5 analysis. 

Oligonucleotide primers VEGF-E2 and VEGF-X14 (figure 
16; figure 5) were used for the specific PCR 
amplification of a 350 bp fragment from VEGF-X. PCR 
amplifications were performed on human multiple 

10 tissue cDNA (MTC™) panels (Clontech human MTC panels 
I and II and human Tumor MTC panel) normalised to the 
mRNA expression levels of six different housekeeping 
genes. In addition, cDNA was made from different 
tumor cell cultures (Caco-2 colorectal 

15 adenocarcinoma; T-84 colorectal carcinoma; MCF-7 
breast adenocarcinoma; T-47D breast ductal gland 
carcinoma; KT1080 bone fibrosarcoma; SaOS-2 
osteosarcoma; SK-N-MC neuroblastoma; HepG2 
hepatoblastoma; JURKAT T-cell leukemia and THP-1 

20 myelomonocytic leukemia) . For the preparation of 

tumor cell cDNA, cells were homogenised and total RNA 
prepared using the RNeasy Mini kit (Qiagen GmbH, 
Hilden, Germany) according to manufacturer's 
instructions. 1 pg of total RNA was reverse 

25 transcribed using oligo(dT)15 as a primer and 50 U of 
Expand™ Reverse Transcriptase (Boehringer Mannheim, 
Mannheim, Germany) according to the manufacturer's 
instructions. PCR reactions with VEGF-X-specif ic os 
glyceraldehyde-3-phosphate dehydrogenase 

30 (G3PDH) -specific primers were then performed on 1 ul 
of this cDNA. For all cDNAs, PCR reactions with 
VEGF-X specific primers were performed in a total 
volume of 50 containing 5 pi (±1 ng) of cDNA, Ix 

Advantage KlenTaq PCR reaction buffer, 0.2 mM dNTP, 

35 250 nM of primers VEGF-E2 and VEGF-X14 and 1 \xl of 

Advantage KlenTaq polymerase mix. Samples were heated 
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to 95°C for 30 s and cycling was done for 30 s at 
9S°C and 30 s at 68°C for 25/ 30 or 35 cycles. 
Control reactions using specific primers that amplify 
a 1 kb fragment of the housekeeping gene G3PDH were 
5 also performed according to the manufacturer's 
instructions , 

Northern blot analysis of VEGF-X. 

Northern blots containing 2 \ig of poly (A) -rich RNA 
10 derived from different human tissues (Clontech 

Laboratories; MTN™ blot, MTN™ blot II and Cancer Cell 
Line MTN™ blot) were hybridized according to the 
manufacturers instructions with a ot- [ 32 P] -dCTP 
r „ random-priming labelled (Multiprime labelling kit, 

€1 15 Roche Diagnostics) 293 bp specific VEGF-X fragment 

J* (PinAI-StuI fragment including 92 bp of the 3' end 

2j coding region and 201 bp of the 3' untranslated 

Qi region of VEGF-X) . The blots were hybridized 

J- overnight at 68 °C and final washes at high stringency 

B * 20 were at 68 °C in O.lx SSC/0.1 % SDS. The membranes 

M were autoradiographed for 1 to 3 days with 

^ intensifying screens. 

dj Full length VEGF-X 

- 25 The effect of full length VEGF-X on proliferation of 

HuVEC cells was determined by the 3 H-Thymidine 
incorporation assay. HuVEC cells were serum starved 
for 24 hours prior to treatment with the full length 
VEGF-X at the concentration range from 100 pg/ml-10 

30 vig/ml. There was no effect of VEGF-X at 100 pg/ml-10 
ng/ml on endothelial cell proliferation. At the 
higher concentrations of FL-VEGF-X (100 ng/ml and 1 
pg/ml) there was a marked inhibition of endothelial 
cell proliferation. This is probably due to the very 

35 high endotoxin level in the samples. The VEGF-X 
sample was purified in order to decrease the 
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endotoxin level and is currently tested in the cell 
proliferation assay. 

The Summary from Testing the CUB Domain 
5 The effect of CUB domain on inhibition of HuVEC 

prolieration either serum- (2%), rh-VEGF or bFGF- 
stimulated, was assessed by the 3 H-Thymidine 
incorporation assay. Cells were serum starved 
followed by the treatment with the CUB domain and 

10 various growth factors. Results showed that the CUB 
domain inhibited endothelial cell proliferation, 
either serum- (2%), rh-VEGF or bFGF-stimulated in a 
dose dependent manner with maximal inhibition at 10 
pg/ml. There was approximately a 2-fold inhibition 

15 of proliferation (at 10 jig/ml) of cells stimulated 

with VEGF and bFGF and nearly a 5-fold inhibition of 
cells stimulated with serum (2%) . Results with the 
LDH assay showed that there was no cytotoxicity 
associated with the inhibition of cell proliferation 

20 by the CUB domain. 

Therefore, the N-terminus of the polypeptide from 
Figure 10 has been shown to possess a CUB domain. 
When database searches are carried out using the 

25 full-length coding sequence the best matches (i.e. 

for a BLAST search, those with the lowest probability 
score) are found with the CUB domain rather than with 
the VEGF-like domain- The best match from searching 
release 37 of the SWISSPROT database (Feb 1999) is to 

30 the CUB domain of a neuropilin from Xenopus laevis, 
and the matches to the CUB domains of human 
neuropilins 1 and 2 are also more significant than 
matches to the VEGFs. 

35 This similarity is provocative, given the 

identification of neuropilin-1 and -2 as cellular 
receptors for the VEGF-A 165 (Stoker et al. 1998, 
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reviewed in Neufeld et al. 1999). It is plausible 
therefore that VEGF-X could exert dual regulatory 
effects: via interaction with the tyrosine kinase 
VEGF-receptors mediated by the VEGF-like domain, as 
5 well as via interaction with VEGF isoforms or with 
the neurophilin receptors, mediated by the CUB 
domain. 

To the best of our understanding the latter would be 
10 entirely novel, and searches on the most recent 
release of the Incyte database do not reveal any 
other proteins containing both CUB and VEGF-like 
domains. This arrangement of domains suggests 
possible positive or negative models of regulation; 

15 

Positive- the VEGF-like domain is able to interact 
productively with the tyrosine kinase VEGF receptors 
giving activation, and the CUB domain is able to 
interact productively with the neuropilin receptor 
20 giving activation. 

Negative- the VEGF-like domain does not interact 
productively with the tyrosine kinase VEGF receptors, 
either preventing receptor dimerisation or blocking 

25 the VEGF binding sites. Further, the CUB domain does 
not interact productively with the neuropilin 
receptors, either preventing receptor activation or 
blocking the VEGF binding sites, or indeed by binding 
to VEGF isoforms and preventing their interaction 

30 with receptors. 



TABLE 1 



ORIGINAL RESIDUE 


EXEMPLARY 


SUBSTITUTIONS 


ALA 


SER, 


THR 




ARG 


LYS 




- 


ASN 


HIS, 


SER 




ASP 


GLU, 


ASN 




CYS 


SER 






GLN 


ASN, 


HIS 




GLU 


ASP, 


GLU 




GLY 


ALA, 


SER 




HIS 


ASN, 


GLN 




ILE 


LEU, 


VAL, 


THR 


LEU 


ILE, 


VAL 




LYS 


ARG, 


GLN, 


GLU, THR 


MET 


LEU, 


ILE, 


VAL 


PHE 


LEU, 


TYR 




SER 


THR, 


ALA, 


ASN 


THR 


SER, 


ALA 




TRP 


ARG, 


SER 




TYR 


PHE 






VAL 


ILE, 


LEU ALA 


PRO 


ALA 
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SEQUENCE LISTING 

Sequence ID No 1 

Sequence ID No 2 
Sequence ID No 3 

Sequence ID No 4 
Sequence ID No 5 
Sequence ID No 6 
Sequence ID No 7 
Sequence ID No 8 
Sequence ID No 9 



corresponds to the amino acid 
sequence from position 23 to 345 
of the amino acid sequence 
illustrated in Figure 10. 

is the amino acid sequence 
illustrated in Figure 10. 

corresponds to the sequence from 
position 257 to 1291 of the 
nucleotide sequence illustrated 
in Figure 9. 

corresponds to the polynucleotide 
sequence of VEGFX1 illustrated in 
Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFX2 illustrated in 
Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFX3 illustrated in 
Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFX4 illustrated in 
Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFX5 illustrated in 
Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFX6 illustrated in 



- 53 - 



Sequence ID No 10 

5 

Sequence ID No 11 



10 

Sequence ID No 12 



Q 15 Sequence ID No 13 



^ Sequence ID No 14 

20 

3 

LTg Sequence ID No 15 

25 

Sequence ID No 16 



30 

Sequence ID No 17 



35 Sequence ID No 18 



Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFX7 illustrated in 
Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFX8 illustrated in 
Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFX9 illustrated in 
Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFX10 illustrated 
in Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFXll illustrated 
in Figure 4. 

corresponds to the polynucleotide 
sequence of VEGFX12 illustrated 
in Figure 4 . 

corresponds to the polynucleotide 
sequence of VEGFX13 illustrated 
in Figure 4. 

corresponds to the polynucleotide 
sequence of VEGFX14 illustrated 
in Figure 4 . 

corresponds to the polynucleotide 
sequence 5'-l in Figure 8. 
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Sequence ID No 19 



Sequence ID No 20 



Sequence ID No 21 



Sequence ID No 22 



Sequence ID No 23 



Sequence ID No 24 



Sequence ID No 25 



Sequence ID No 26 



Sequence ID No 27 



Sequence ID No 28 



corresponds to the polynucleotide 
sequence 5 1 -2 in Figure 8 . 

corresponds to the polynucleotide 
sequence of VEGFX6 illustrated in 
Figure 13. 

corresponds to the polynucleotide 
sequence of VEGFX7 illustrated in 
Figure 13. 

corresponds to the polynucleotide 
sequence of VEGFX8 illustrated in 
Figure 13. 

corresponds to the polynucleotide 
sequence of VEGFX9 illustrated in 
Figure 13. 

corresponds to the polynucleotide 
sequence of VEGBAC1 illustrated 
in Figure 13. 

corresponds to the polynucleotide 
sequence of VEGBAC2 illustrated 
in Figure 13, 

corresponds to a polypeptide 
having the amino acid sequence 
from amino acid position 4 0 to 
150 of the sequence of Figure 10. 

corresponds to a polypeptide 
having the amino acid sequence 
illustrated in Figure 26, 

corresponds to the sequence from 
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position 5 to 508 of the 
nucleotide sequence illustrated 
in Figure 26, 

Sequence ID No 29 corresponds to the nucleotide 

sequence from position 5 to 508 • 
of the nucleotide sequence 
illustrated in Figure 26. 

Sequence ID No 30 corresponds to the sequence from 

position 214 to 345 of the 
nucleotide sequence illustrated 
in Figure 10. 



CLAIMS 



1. A nucleic acid molecule encoding a VEGF-X 
protein or a functional equivalent, derivative or 
bioprecursor thereof, said protein comprising any of 
the sequences from position 23 to 345 of the amino 
acid sequence illustrated in Figure 10, or the 
complete sequence as illustrated in Figure 10. 

2. A nucleic acid molecule according to claim 1 
wherein said nucleic acid is a DNA molecule. 

3. A nucleic acid molecule according to claim 1 
wherein said nucleic acid is a cDNA molecule. 

4. A nucleic acid molecule according to claim 3 
comprising the nucleotide sequence from position 257 
to 1291 of the nucleotide sequence illustrated in 
Figure 9, or sequences that hybridise thereto under 
high stringency conditions or the complement thereto, 

5. An antisense molecule capable of hybridising to 
a molecule according to claim 1 under high stringency 
conditions. 

6. A nucleic acid molecule according to claim 1 
which is of mammalian origin. 

7. A nucleic acid molecule according to claim 6 
which is of human origin. 

8. An isolated VEGE^-X protein, or a functional 
equivalent, derivative or bioprecursor thereof r 
having an amino acid sequence from position 23 to 345 
of the amino acid sequence illustrated in Figure 10 
or the complete amino acid sequence of Figure 10. 
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9. A VEGF-X protein, or a functional equivalent, 
derivative or bioprecusor thereof, encoded by a 
nucleic acid molecule as defined in claim 1. 

10. A protein according to claim 9, which comprises 
the amino acid sequence illustrated in Figure 10. 

11. An expression vector comprising a nucleic acid 
molecule according to claim 1. 

12. An expression vector according to claim 11 
further comprising a nucleotide sequence encoding a 
reporter molecule. 

13. An expression vector comprising an antisense 
molecule according to claim 5. 

14. A nucleic acid molecule according to claim 1 for 
use as a medicament, 

15. A host cell transformed or transfected with an 
expression vector according to claim 11 or 12. 

16. A host cell transformed or transfected with an 
expression vector according to claim 13. 

17. A transgenic cell, tissue or organism comprising 
a transgene capable of expressing a VEGF-X protein 
according to claim 8 or 9. 

18. A transgenic cell, tissue or organism according 
to claim 17, wherein said transgene is included in an 
expression vector. 

19. A VEGF-X protein or a functional equivalent, 
derivative or bioprecursor thereof expressed by a 
cell according to claim 15. 
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20. A VEGF-X protein, or a functional equivalent/ 
derivative or bioprecursor thereof, expressed by a 
transgenic cell, tissue or organism according to 
claim 17. 

21. A process for producing a VEGF-X protein 
according to claim 8, said process comprising 
transforming a host cell or organism with an 
expression vector according to claim 11, and 
recovering the expressed protein from said host cell 
or organism. 

22. An antibody capable of binding to a protein 
according to claim 8/ or an epitope thereof. 

23. An antibody according to claim 22 for use as a 
medicament . 

24. A pharmaceutical composition comprising an 
antibody according to claim 22 together with a 
pharmaceutical^ acceptable carrier diluent or 
excipient thereof . 

25. A method of identifying VEGF-X protein in a 
sample which method comprises contacting said sample 
with an antibody according to claim 22 and monitoring 
for binding of any protein to said antibody . 

26. A kit for identifying the presence of VEGF-X 
protein in a sample which comprises an antibody 
according to claim 22 and means for contacting said 
antibody with said sainple. 

21. A method of identifying compounds which modulate 
angiogenesis which method comprises providing a host 
cell or organism according to claim 15 or a 
transgenic cell, tissue or organism according to 
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claim 17, contacting a test compound with said cell, 
tissue or organism and monitoring for an effect of 
said compound on said VEGF compared to a host cell or 
organism according to claim 15 or a transgenic cell 
tissue or organism which has not been contacted with 
said compound, 

28. A compound identifiable according to the method 
of claim 27 . 

29. A compound according to claim 28 for use as a 
medicament . 

30. A nucleic acid sequence comprising the 
nucleotide sequences illustrated in any of Figures 3, 
5, 8 or 13. 

31- A method for producing a polypeptide , said 
method comprising the steps of: 

a) culturing the host cell of claim 15 under 
conditions suitable for expression of the 
polypeptide; and 

b) recovering the polypeptide from the host 
cell culture, 

32. A method of inhibiting angiogenic activity and 
inappropriate vascularisation including formation and 
proliferation of new blood vessels, growth and 
development of tissues, tissue regeneration and organ 
and tissue repair in a subject said method comprising 
administering to said' subject an amount of an 
antisense molecule according to claim 5 in sufficient 
concentration to reduce or prevent said angiogenic 
activity. 

33. A method of inhibiting angiogenic activity or 
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inappropriate vascularisation including any of 
formation and proliferation of new blood vessels, 
growth and development of tissues, tissue 
regeneration and organ and tissue repair in a subject 
said method comprising administering to said subject 
an amount of an antibody according to claim 22 in 
sufficient concentration to reduce or prevent said 
angiogenic activity or inappropriate vascularisation. 

34. A method of inhibiting angiogenic activity or 
inappropriate vascularisation including any of 
formation and proliferation of new blood vessels, 
growth and development of tissues, tissue 
regeneration and organ and tissue repair in a 
subject, said method comprising implanting in said 
subject cells that express an antibody according to 
claim 22. 

35. A method of treating or preventing any of 
cancer, rheumatoid arthritis, psoriasis and diabetic 
retinopathy, said method comprising administering to 
said subject an amount of an antisense molecule 
according to claim 5 in sufficient concentration to 
treat or prevent said disorders. 

36. A method of treating or preventing any of 
cancer, rheumatoid arthritis, psoriasis and diabetic 
retinopathy, said method comprising administering to 
said subject an amount of an antibody according to 
claim 22 in sufficient concentration to reduce or 
prevent said disorders. 

37. A method of promoting angiogenic activity or 
vascularisation to promote wound healing, skin graft 
growth, tissue repair, proliferation of new blood 
vessels, tissue regeneration and organ repair which 
method comprises applying or delivering to a site of 
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interest a therapeutically effective amount of any of 
a group selected from a protein according to claim 8 
and a nucleic acid molecule encoding a VEGF-X protein 
or a functional equivalent, derivative or 
5 bioprecursor thereof comprising an amino acid 

sequence illustrated in Figure 10, an expression 
vector comprising said nucleic acid molecule and a 
pharmaceutical composition comprising any of said 
nucleic acid molecule and said protein. 

10 

38, A method of treating wounds selected from the 
group consisting of dermal ulcers, pressure sores, 
venous sores, diabetic ulcers and burns by applying 
to said wound a therapeutically effective amount of 
15 any of a VEGF-X protein according to claim 8, a 

pharmaceutical composition comprising said protein 
and a pharmaceutically acceptable carrier, diluent or 
excipient therefor. 

20 39. A nucleic acid molecule encoding a polypeptide 
having a CUB domain said polypeptide comprising the 
amino acid sequence from position 40 to 150 of the 
sequence of Figure 10. 

25 40* A nucleic acid molecule encoding a polypeptide 
having a CUB domain, said polypeptide comprising the 
amino acid sequence of Figure 26. 

41. A nucleic acid molecule according to claim 40, 
30 comprising the nucleotide sequence from position 5 to 

508 of the sequence illustrated in Figure 26. 

42. A nucleic acid molecule according to claim 41 
comprising the nucleotide sequence illustrated in 

35 Figure 26. 



43. A nucleic acid molecule encoding a VEGF like 
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domain comprising the sequence from position 214-345 
of the sequence of Figure 10 or the sequence from 
position 15 to 461 illustrated in Figure 24, 

44. An expression vector comprising a nucleic acid 
molecule according to claim 39 or 40. 

45. An expression vector comprising a nucleic acid 
molecule according to claim 43. 

4 6. A host cell transformed or transfected with an 
expression vector according to claim 44. 

47 . A host cell transformed or transfected with an 
expression vector according to claim 45. 

48. A protein expressed by the cell according to 
claim 46. 

49. A protein expressed by the cell according to 
claim 47. 

50. A method of identifying compounds that inhibit 
or enhance angiogenic activity, said method 
comprising contacting a cell expressing a VEGF 
receptor and/or a neuropilin 1 or 2 type receptor 
with said compound in the presence of a VEGF-X 
protein according to claim 8 and monitoring for the 
effect of said compound or said cell when compared to 
a cell which has not been contacted with said 
compound. 

51. A compound identifiable according to the method 
of claim 50 as an inhibitor or enhancer of angiogenic 
activity. 

52. A method of inhibiting angiogenic activity or 
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inappropriate vascularisation, said method comprising 
contacting a cell expressing a VEGF receptor and a 
neuropilin type receptor with a protein selected from 
any of a protein according to claim 8 and a protein 
according to claim 48 or a protein according to claim 
49. 

53. Use of a nucleotide sequence illustrated in any 
of Figures 14 and 15 in identifying a VEGF-X protein 
according to claim 8. 

54. A nucleic acid molecule encoding a polypeptide 
comprising a CUB domain having the sequence from 
position 40 to 150 of the sequence of Figure 10 or 
from position 5 to 508 of the sequence of Figure 26 
and a sequence encoding a VEGF domain, 

55. A nucleic acid molecule according to claim 54 
wherein said sequence encoding said VEGF domain is 
selected from the sequences encoding any of VEGF A to 
D or isoforms or variants thereof, 

56. A nucleic acid molecule encoding a polypeptide 
comprising the amino acid sequence from position 40 
to 150 of the sequence illustrated in Figure 10 for 
use as a medicament. 

57. Use of a nucleic acid molecule encoding a 
polypeptide having the amino acid sequence from 
position 40 to 150 of the sequence illustrated in 
Figure 10 in the manufacture of a medicament for 
treatment of disease conditions associated with 
inappropriate angiogenesis such as tumour or cancer 
growth, retinopathy/ osteoarthritis or psoriasis. 

58. A polypeptide comprising the amino acid sequence 
from position 40 to 150 of the sequence illustrated 
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in figure 10 for use as a medicament. 

59. A polypeptide comprising the amino acid sequence 
from position 40 to 150 of the sequence illustrated 
in Figure 10 in the manufacture of a medicament for 
the treatment of disease conditions associated with 
inappropriate angiogenesis such as tumour growth, 
retinopathy, osteoarthritis or psoriasis. 

60. Use of a COB domain comprising the amino acid 
sequence from position 40 to 150 of the sequence of 
Figure 10, or the amino acid sequence of Figure 26, 
to identify compounds which inhibit angiogenic 
activity in a method according to claim 50. 

61. A method of inhibiting angiogenic activity and 
inappropriate vascularisation including formation and 
proliferation of new blood vessels, growth and 
development of tissues, tissue regeneration and organ 
and tissue repair in a subject said method comprising 
administering to said subject an amount of a 
polypeptide having an amino acid sequence from 
position 40 to 150 of the sequence illustrated in 
Figure 10 or a nucleic acid molecule according to any 
of claims 39 to 42 in sufficient concentration to 
reduce or prevent said angiogenic activity. 

62. A method of treating or preventing any of 
cancer, rheumatoid arthritis, psoriasis and diabetic 
retinopathy, said method comprising administering to 
said subject an amount of a polypeptide having an 
amino acid sequence from position 40 to 150 of the 
sequence illustrated in Figure 10 or a nucleic acid 
molecule according to any of claims 39 to 42 in 
sufficient concentration to treat or prevent said 
disorders. 
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63. An antisense molecule capable of hybridising to 
a molecule according to claim 39 under high 
stringency conditions. 

5 64. An antisense molecule capable of hybridising to 
a molecule according to claim 43 under high 
stringency conditions. 

65. A transgenic cell, tissue or organism comprising 
10 a transgene capable of expressing a protein according 

to claim 48. 

66. A transgenic cell, tissue or organism comprising 
a transgene capable of expressing a protein according 

15 to claim 49. 

67. A transgenic, cell tissue or organism 
according to claim 65 or 66, wherein said transgene 
is included in an expression vector according to 

20 claim 41 or 42. 

68. An antibody capable of binding to a protein 
according to claim 48 or an epitope thereof. 

25 69. An antibody capable of binding to a protein 
according to claim 4 9 or an epitope thereof. 

70. A pharmaceutical composition comprising an 
antibody according to claim 68 or 69 together with a 

30 pharmaceutical^ acceptable carrier diluent or 
excipient therefor. 

71. A pharmaceutical composition comprising a 
compound according to claim 48 together with a 

35 pharmaceutically acceptable carrier, diluent or 
excipient therefor. 
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72- A nucleic acid molecule encoding a variant 
VEGF-X protein having any of the sequences of 
nucleotides illustrated in Figure 12. 
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ABSTRACT 



VASCULAR ENDOTHELIAL GROWTH FACTOR-X 



There is provided a novel vascular endothelial growth 
factor, herein designated VEGF-X, in addition to the 
nucleic acid molecule encoding it, a host cell 
transformed with said vector and compounds which 
inhibit or enhance angiogenesis. Also provided is the 
sequence of a CUB domain present in the sequence of 
VEGF-X which domain itself prevents angiogenesis and 
which is used to treat diseases associated with 
inappropriate vascularisation or angiogenesis. 
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Figures 
Figure 1. 



1 AAAATGTATG GATACAACTT ACGTTTGATG AAAGATTTGG GCTTGAAGAC CCAGAAGATG 
TTTTACATAC CTATGTTGAA TGCAAACTAC TTTCTAAACC CGAACTTCTG GGTCTTCTAC 

61 ACATATGCAA GTATGATTTT GTAGAAGTTG AGGAACCCAG TGATGGAACT ATATTAGGGC 
TGTATACGTT CATACTAAAA CATCTTCAAC TCCTTGGGTC ACTACCTTGA TATAATCCCG 

121 GCTGGTGTGG TTCTGGTACT GTACCAGGAA AACAGATTTC TAAAGGAAAT CAAATTAGGA 
CGACCACACC AAGACCATGA CATGGTCCTT TTCTCTAAAG ATTTCCTTTA GTTTAATCCT 

+1 MetAsn IlePheLeu LeuAsnLeuLeu ThrGluGlu ValArgLeu 

] - 

131 TAAGATTTGT ATCTGATGAA TATTTTCCTT CTGAACCTTC TAACAGAGGA GGTAAGATTA 
ATTCTAAACA TAGACTACTT ATAAAAGGAA GACTTGGAAG ATTGTCTCCT CCATTCTAAT 

+1 TyrSerCysThr ProArgAsn PheSerVal SerIl«ArgGlu GluLauLys ArgThrAsp 



241 TACAGCTGCA CACCTCGTAA CTTCTCAGTG TCCATAAGCC AAGAACTAAA GAGAACCGAT 
ATGTCGACG? GTGGAGCATT GAAGAGTCAC AGGTATTCCC TTCTTGATTX CTCTTGGCTA 

+1 ThrllerheTrp ProGlyCys LeuLeuVal LysArgCysGly GlyAanCys AlaCysCys 



301 ACCATTTTCt GGCCAGGTTG TCTCCTGGTT AAACGCTGTG GTGGGAACTG TGCCTGTTGT 
TGGTAAAAGA CCGGTCCAAC AGAGGACCAA TTTGCGACAC CACCCTTGAC ACGGACAACA 

*1 LeuHisAsnCys AsnGluCys GlnCysVal ProSerLysVal ThrLysLys TyrHisGlu 



3 61 CTCCACAATT GCAATGAATG TCAATGTGTC CCAAGCAAAG TTACTAAAAA ATACCACGAG 
GAGGTGTTAA CGTTACTTAC AGTTACACAG GGTTCGTTTC AATGATTTTT TATGGTGCTC 

+1 ValleuGlnLeu ArgProLys ThrGlyVal ArgGlyLeuHis LysSerLeu ThrAspVal 



421 GTCCTTCAGT TGAGACCAAA GACCGGTGTC AGGGGATTGC ACAAATCACT CACCGACGTG 
CAGGAAGTCA ACTCTGGTTT CTGGCCACAG TCCCCTAACG TGTTTAGTGA GTGGCTGCAC 

+1 AlaLeuGluHis HisGluGlu CysAspCys ValCysArgGly SerThrGly Gly 



461 GCCCTGGAGC ACCATGAGGA GTGTGACTGT GTGTGCAGAG GGAGCACAGG AGGATAGCCG 
CGGGACCTCG TGGTACTCCT CACACTGACA CACACGTCTC CCTCGTGTCC TCCTATCGGC 

541 CATCACCACC AGCAGCTCTT GCCCAGAGCT GTGCAGTGCA GTGGCTGATT CTATTACAGA 
GTAGTGGTGG TCGTCGAGAA CGGGTCTCGA CACGTCACGT CACCGACTAA GATAATCTCT 

601 ACGTATGCGT TATCTCCATC CTTAATCTCA GTTGTTTGCT TCAAGGACCT TTCATCTTCA 
TGCATACGCA ATAGACGTAG G AA TT AG ACT CAACAAACGA AGTTCCTGGA AAGTAGAAGT 



661 GGATTTACAG TGCATTCTGA AAGAGGAGAC ATCAAACAGA ATTAGGAGTT GTGCAACAGC 
CCTAAATGTC ACGTAAGACT TTCTCCTCTG TAGTTTGTCT TAATCCTCAA CACGTTGTCC 
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721 TCTTTTGAGA GGAGGCCTAA AGGACAGGAG AAAAGGTCTT CAATCGTGGA AAGAAAATTA 
AGAAAACTCT CCTCCGGATT TCCTGTCCTC TTTTCCAGAA GTTAGCACC? TTCTTTTAAT 

781 AATGTTGTAT TAAATAGATC ACCAGCTAGT TTCAGAGTTA CCATGTACGT ATTCCACTAG 
TTACAACATA ATTTATCTAG TGGTCGATCA AAGTCTCAAT GGTACATGCA TAAGGTGATC 

841 CTGGGTTCTG TATTTCAGTT CTTTCGATAC GCCTTAGGGT AATGTCAGTA CAGGAAAAAA 
GACCCAAGAC ATAAAGTCAA GAAAGCTATG CCGAATCCCA TTACAGTCAT GTCCTTTTTT 

901 ACTGTGCAAG TGAGCACCTG ATTCCGTTGC CTTGCTTAAC TCTAAAGCTC CATGTCCTGG 
TGACACGTTC ACTCGTGGAC TAAGGCAACG GAACGAATTC AGATTTCGAG GTACAGGACC 

961 GCCTAAAATC GTATAAAATC TGCSATTTTTT TTTTTTTTTT TGCTCATATT CACATATGTA 
CGGAT7TTAG CATATTTTAG ACCTAAAAAA AAAAAAAAAA ACGAGTATAA GTGTATACAT 

1021 AACCAGAACA TTCTATGTAC TACAAACCTG GTTTTTAAAA AGGAACTATG TTGCTATGAA 
TTGGTCTTGT AAGATACATG ATGTTTGGAC CAAAAATTTT TCCTTGATAC AACGATACTT 

1081 TTAAACTTGT GTCGTGCTGA TAGGACAGAC TGGATTTTTC ATATTTCTTA TTAAAATTTC 
AATTTGAACA CAGCACGACT ATCCTGTCTG ACCTAAAAAG TATAAAGAAT AATTTTAAAG 

1141 TGCCATTTAG AAGAAGAGAA CTACATTCA? GGTTTGGAAG AGATAAACCT GAAAAGAAGA 
ACGGTAAATC TTCTTCTCTT GATGTAAGTA CCAAACCTTC TCTATTTGGA CTTTTCTTCT 

1201 GTGGCCTTAT CTTCACTTTA TCGATAAGTC AGTTTATTTG TTTCATTGTG TACATTTTTA 
CACCGGAATA GAAGTGAAAT AGCTATTCAG TCAAATAAAC AAAGTAACAC ATGTAAAAAT 

1261 TATTCTCCTT TTGACATTAT AACTGTTGGC TTTTCTAATC TTGTTAAATA TATCTATTTT 
ATAAGAGGAA AACTGTAATA TTGACAACCG AAAAGATTAG AACAATTTAT ATACATAAAA 

1321 TACCAAAGGT ATTTAATATT CTTTTTTATG ACAACTTAGA TCAACTATTT TTAGCTTGGT 
ATGGTTTCCA TAAATTATAA GAAAAAATAC TGTTGAATCT AGTTGATAAA AATCGAACCA 

1381 AAATTTTTCT AAACACAATT GTTATAGCCA GAGGAACAAA GATGATATAA AATATTGTTG 
TTTAAAAAGA TTTGTGTTAA CAA/TATCGGT CTCCTTGTTT CTACTATATT TTA7AACAAC 

1441 CTCTGACAAA AATACATGTA TTTCATTCTC GTATGGTGCT AGAGTTAGAT TAATCTGCAT 
GAGACTGTTT TTATGTACAT AAAGTAAGAG CATACCACGA TCTCAATCTA ATTAGACGTA 

1501 TTTAAAAAAC TGAATTGGAA TAGAATTGGT AAGTTGCAAA GACTTTTTCA AAATAATTAA 
AAATTTTTTG ACTTAACCTT ATCTTAACCA TTCAACGTTT CTGAAAAACT TTTATTAATT 

1561 ATTATCATAT CTTCCATTCC TGTTATTGGA GATGAAAATA AAAAGCAACT TATGAAAGTA 
TAATAGTATA GAAGGTAAGG ACAATAACCT CTACTTTTAT TTTTCGTTGA ATACTTTCAT 

1621 GACATTCAGA TCCAGCCATT ACTAACCTAT TCCTTTFTTG GGGAAATCTG AGCCTAGCTC 
CTGTAAG7CT AGGTCGGTAA TGATTGGATA AGGAAAAAAC CCCTTTAGAC TCGGATCGAG 

1681 AGAAAAi.CAT AAAGCACCTT GAAAAAGACT TGGCAGCTTC CTGATAAAGC GTCCTGTGC7 
TCTTTTTGTA TTTCGTGGAA CTTTTTCTGA ACCGTCGAAG GACTATTTCG CACGACACGA 

1741 GTGCAGTAGG AACACATCCT ATTTATTGTG ATGTTGTGGT TTTATTATCT TAAACTCTGT 
CACGTCATCC TTGTGTAGGA TAAATAACAC TACAACACCA AAATP.ATAGA ATTTGAGACA 

1801 TCCATACACT TGTATAAATA CATGGATATT TTTATGTACA GAAGTATGTC TCTTAACCAG 
AGGTATGrGA ACATATTTAT GTACCTATAA AAATACATGT CTTCATACAG AGAATTGGTC 





1861 TTCACTTATT GTACCTGG 
AAGTGAXTAA CATGGACC 
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Figure 2, Predicted VEGF-fike protein encoded by Incyte contig of 8/12/98 

1 MNIFLLNLCT EEVRLYSCTP KNFSVSIREE LXRTDTIFWP GCLLVXRCGG 

51 NCACCLHNCN BCQCVPSKVT KKYHEVLQLR PXTGVRGLHK SLTDVALBKH 

ZOX EECDCVCRGS TGG 
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Figure 3. PCR primers for cloning VEGF-X 
vegfXl AAAATGTATGGATACAACTTAC 

vegfX2 GTTTGATGAMGATTTGGGCTTG 

vegfX3 TTTCTAAAGGAAATCAAATTAG 

vegfX4 GATAAGATTTGTATCTGATG 

vegfX5 GATGTCTCCTCTTTCAG 

vegfX6 GCACAACTCCTAATTCTG 

vegfX7 AGCACCTGATTCCGTTGC 

vegfX8 TAGTACATAGAATGTTCTGG 

vegfX9 AAGAGACATACTTCTGTAC 

vegfXlO CCAGGTACAATAAGTGAACTG 



Express Mail Label No. 

EL024661737US 

Date of Deposit: 12/21/99 



Figure 4. Variants Isolated by PCR 



a b c d .e f 

PCR primers- -> -> «_ <_ > . 



Incytecontig i t ^^T" 
(8/12/98) 



clone 22, 29, 41 C 



clone 52, 59 



clones 57, 25, C 
26. 27 



2. 1kb clones 1, 
2.3 



clone 1 5. 20 | ~^ /N> t 



primers- a-vegfX1 b-vegfX2 c-veg/X5 

(see fig 3) d-vegfX6 e-vegfX9 f-vegfX10 
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Figure 5. VEGF-X 5' RACE primers 



vegfXl 1 



CCTTTAGAAATCTGTTTTCCTGGTACAG 



vegfXl2 



GGAAAATATTCATCAGATACAAATCTTATCC 



vegfXl3 



GGTCCAGTGGCAAAGCTGAAGG 



vegfXl4 



CTGGTTCAAGATATCGAATAAGGTCTTCC 
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Figure 6, DNA sequence assembled from in-house clones and 5'RACE 



1 TGCCAGAGCA GGTGGGCGCT TCCACCCCAG TGCAGCCTTC CCCTGGCGGT GGTGAAAGAG 
ACGGTCTCGT CCACCCGCGA AGGTGGGGTC ACGTCGGAAG GGGACCGCCA CCACTTTCTC 

51 ACTCGGGAGT CGCTGCTTCC AAAGTGCCCG CCGTGAGTCA GCTCTCACCC CAGTCAGCCA 
TGAGCCCTCA GCGACGAAGG TTTCACGGGC GGCACTCACT CGAGAGTGGG GTCAGTCGGT 

+2 KetSdrLeu PheGlyLeuLeu LeuLeuThr SerAlaLeu AlaGlyGlnArg GlrvGlyTh 

J- - - 

121 AATGAGCCTC TTCGGGCTTC TCCTGCTGAC ATCTGCCCTG GCCGGCCAGA GACAGGGGAC 
TTACTCGGAG AAGCCCGAAG AGGACGACTG TAGACGGGAC CGGCCGGTCT CTGTCCCCTG 

+2 rGlnAIaGlu SerAsnLeuSer SerLysPho GlnPheSer SerAsnLysGlu GlnAsnGl 



161 TCAGGCGGAA TCCAACCTGA GTAGTAAATT CCAGTTTTCC AGCAACAAGG AACAGAACGG 
AGTCCGCCTT AGGTTGGACT CATCATTTAA GGTCAAAAGG TCGTTGTTCC TTGTCTTGCC 

+2 yValGlnAs? ProGlnHisGlu Argil ell g ThrValSer ThrAsnGlySer IleHisSa 



241 AGTACAAGA? CCTCAGCATG AGAGAATTAT TACTGTGTCT ACTAATGGAA GTATTCACAG 
TCATGtTCTA GGAGTCGTAC TCTCTTAATA ATGACACAGA TGATTACCTT CATAAGTGTC 

+2 rProArjPhe ProHisThrTyr ProArgAsn ThrValLeu ValTrpArgLeu V&lAlaVa 



301 CCCAAGGTT? CCTCATACTT ATCCAAGAAA TACGGTCTTG GTATGGAGAT TAGTAGCAGT 
GCGTTCCAAA GGAGTATGAA TAGGTTCTTT ATGCCAGAAC CATACCTCTA ATCATCGTCA 

+2 lGluGIuAsn ValTrpIleGln LeuThrPhe AspGluArg PheGlyLeuGlu AspProGl 



361 AGAGGAAAA? GTATGGATAC AACTTACGTT TGATGAAAGA WTGGGCTTG AAGACCCAGA 
TCTCCTt-TTA CATACCTATG TTGAATGCAA ACTACTTTCT AAACCCGAAC TTCTGGGTCT 

+2 uAspAscIle CysLysTyrAsp PheValGlu ValGluGlu ProSerAspGly ThrlleLe 



421 AGATGACA2A TGCAAGTATG ATTTTGTAGA AGTTGAGGAA CCCAGTGATG GAACTATATT 
TCTACTGTAT ACGTTCATAC TAAAACATCT TCAACTCCTT GGGTCACTAC CTTGATATAA 

+2 uGlyArcTrp CysGlySerGly ThrValPro GlyLysGln IleSerLysGly AsnGlnll 



481 AGGGCGC7GG TGTGGTTCTG GTACTGTACC AGGAAAACAG ATTTCTAAAG GAAATCAAAT 
TCCCGCCACC ACACCAAGAC CATGACATGG TCCTTTTGTC TAAAGATTTC CTTTAGTTTA 

+2 eArglleArg PheValSerAsp GluTyrPho ProSerGlu ProGlyPheCys IleKisTy 



541 TAGGACAAGA TCTGTATCTG ATGAATATTT TCCTTCTGAA CCAGGGTTCT GCATCCACTA 
ATCCTATTCT AAACATAGAC TACTTATAAA AGGAAGACTT GGTCCCAAGA CGTAGGTGAT 

♦2 rAsnlltVal M*£?r©GlnPhe ThrGluAla ValS«rPro SerValLeuPr© ProSerAl 



601 CAACATTGTC ATGCCACAAT TCACAGAAGC TGTGAGTCCT TCAGTGCTAC CCCCTTCAGC 
GTTGTAACAG TACGGTGTTA AGTGTCTTCG ACACTCAGGA AGTCACGATG GGGGAAGTCG 

+2 aLeuPrc-eu As?^eaL*uAsrv AsnAlalle ThrAlaPhe SerThrLeuGlu AspLeuIl 
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661 TTTGCCACTG GACCTGCTTA ATAATGCTAT AACTGCCTTT AGTACCTCGG AAGACCTTAT 
AAACGGTGAC CTGGACGAAT TATTACGATA TTGACGGAAA TCATCCAACC TTCTCGAATA 

+2 eArgTyrL^u GluProGluArg TrpGlnLeu AspLeuGlu AspLeuTyrArg ProThrTr 



721 TCGATATCTT GAACCAGAGA GATGGCAGTT GGACTTAGAA GAT CT AT AT A GGCCAACTTG 
AGCTATAGAA CTTGGTCTCT CTACCGTCAA CCf GAATCTT CTAGATATAT CCGGTTGAAC 

+2 pGlnLeuLeu GlyLysAlaPhe ValPheGly ArgLysSer ArgValValAsp LeuAsnLe 



761 CCAACTTCTT GGCAAGGCTT TTGTTTTTGG AAGAAAATCC AGAGTGGTGG ATCTGAACCT 
CGTTGAAGAA CCGTTCCGAA AACAAAAACC TTCTTTTAGG TCTCACCACC TAGACTTGGA 

+2 uLeuThrGlu GluValArgLau TyrSerCys ThrProArg AsnPheSerVal SerlleAr 



841 TCTAACAGAG GACGTAAGAT TATACAGCTG CACACCTCGT AACTTCTCAG TGTCCATAAG 
AGATTGTCTC CTCCATTCTA ATATGTCGAC GTGTGGAGCA TTGAAGAGTC ACAGGTATTC 

+2 gGlvGluLeu LysArgThrAsp ThrllePhe TrpProGly CysLeuLeuVal LysArgCy 



901 GGAAGAACTA AAGAGAACCG ATACCATTTT CTGGCCAGGT TGTCTCCTGG TTAAACGCTC 
CCTTCTTGAT TTCTCTTGGC TATGGTAAAA GACCGGTCCA ACAGAGGACC AATTTGCGAC 

+2 sGlyGlyAsn CysAlaCyaCys LeuHisAsn CysAsr.Glu CysGlnCysVal PxoSerLy 



961 TGGTGGGAAC TGTGCCTGTT GTCTCCACAA TTGCAATGAA TGTCAATGTG TCCCAAGCAA 
ACCACCCTTG ACACCGACAA CAGAGGTGTT AACGTTACTT ACAGTTACAC AGGGTTCGTT 

+2 sValThrLys LysTyrHisGlu ValLeuGln LeuArgPro LysThrGlyVal ArgGlyLe 



1021 AGTTACTAAA AAATACCACG AGGTCCTTCA GTTGAGACCA AAGACCGGTG TCAGGGGATT 
TCAATGATTT TTTATGGTGC TCCAGGAAGT CAACTCTGGT TTCTGGCCAC AGTCCCCTAA 

+2 uHisLysSer LeuThrAspVal AlaLeuGlu HisHisGlu GluCysAspCys ValCysAr 



1081 GCACAAATCA CTCACCGACG TGGCCCTGGA GCACCATGAG GAGTGTGACT GTGTGTGCAG 
CGTGTTTAC? GAGTGGCTGC ACCGGGACCT CGTGGTACTC CTCACACTGA CACACACGTC 

+2 gGlySerThr GlyGly 



1141 AGGGAGCACA GGAGGATAGC CGCATCACCA CCAGCAGCTC TTGCCCAGAG CTGTGCAGTG 
TCCCTCGTGT CCTCCTATCG GCGTAGTGGT GGTCGTCGAG AACGGGTCTC GACACGTCAC 

1201 CAGTGGCTGA TTCTATTAGA GAACGTATGC GTTATCTCCA TCCTTAATCT CAGTTGTTTG 
GTCACCGACT AAGATAATCT CTTGCATACG CAATAGAGGT AGGAATTAGA GTCAACAAAC 

1261 CTTCAAGGAC CTTTCATCTT CAGGATTTAC AGTGCATTCT GAAAGAGGAG ACATCAAACA 
GAAGTTCCTG GAAAGTAGAA GTCCTAAATG TCACGTAAGA CTTTCTCCTC TGTAGTTTGT 

1321 GAATTAGGAG TTGTGCAACA GCTCTTTTGA GAGGAGGCCT AAACGACAGG AGAAAAGGTC 
CTTAATCC7C AACACGTTGT CGAGAAAACT CTCCTCCGGA TTTCCTGTCC TCTTTTCCAG 

1381 TTCAATC3TC GAAAGAAAAT TAAATGTTGT ATTAAATAGA TCACCAGCTA CTTTCAGAGT 
AAGTTASrAC CTTTCTTTTA ATTTACAACA TAATTTATCT AGTGGTCCAT CAAAGTCTCA 

1441 TACCATGTAC GTATTCCACT AGCTGGGTTC TGTATTTCAG TTCTTTCGAT ACGCCTTAGG 
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ATGGTACATG CATAAGGTGA TCGACCCAAG ACATAAAGTC AAGAAAGCTA TGCCGAATCC 

15 OX GTAATGTCAG TACAGGAAAA AAACTGTGCA AGTGAGCACC TGATTCCGTT GCCTTGCTTA 
CATTACAGTC ATGTCCTTTT TTTGACACGT TCACTCGTGG ACTAAGGCAA CGGAACGAAT 

1561 ACTCTAAAGC TCCATGTCCT GGGCCTAAAA TCGTATAAAA TCTGGATTOT TTTTTTTTTT 
TGAGATTTCG AGGTACAGGA CCCGGATTTT AGCATATTTT AGACCTAAAA AAAAAAAAAA 

1621 TTTGCTCATA TTCACATATG TAAACCAGAA CATTCTATGT ACTACAAACC TGGTTTXTAA 
AAACGAGTAT AAGXGTATAC ATTTGGTCTT GTAAGATACA TGATGTTTGG ACCAAAAATT 

1631 AAAGGAACTA TGTTGCTATG AATTAAACTT GTGTCGTGCT GATAGGACAG ACTGGATTTT 
TTTCCT7GAT ACAACGATAC TTAATTTGAA CACAGCACGA CTATCCTGTC TGACCTAAAA 

1741 TCATATTTCT TATTAAAATT TCTGCCATTT AGAAGAAGAG AACTACATTC ATGGTMGGA 
AGTATAAAGA ATAATTTTAA AGACGGTAAA TCTTCTTCTC TTGATGTAAG TACCAAACCT 

1801 AGAGATAAAC CTGAAAAGAA GAGTGGCCTT ATCTTCACTT TATCGATAAG CCAGTTTATT 
TCTCTATTTG GACTTTTCTT CTCACCGGAA TAGAAGTGAA ATAGCTATTC GGTCAAATAA 

1861 TGTTTCATTG TGTACATTTT TATATTCTCC TTTTGACATT ATAACTGTTG GCTTTTCTAA 
ACAAAGTAAC ACATGTAAAA ATATAAGAGG AAAACTGTAA TATTGACAAC CGAAAAGATT 

1921 TCTTGTTAAA TATATCTATT TTTACCAAAG GTATTTAATA TTCTTTTTTA TGACAACTTA 
AGAACAAtTT ATATAGATAA AAATGGTTTC CATAAATTAT AAGAAAAAAT ACTGTTGAAT 

1981 GATCAACTAT TTTTAGCTTG GTAAATTTTT CTAAACACAA TTGTTATAGC CAGAGGAACA 
CTAGTTGATA AAAATCGAAC CATTTAAAAA GATTTGTGTT AACAATATCG GTCTCCTTGT 

2041 AAGATGAIAT AAAATATTGT TGCTCTGACA AAAATACATG TATTTCATTC TCGTATGGTG 
TTCTACTATA TTTTATAACA ACGAGACTGT TTTTATGTAC ATAAAGTAAG AGCATACCAC 

2101 CTAGAGTTAG ATTAATCTGC ATTTTAAAAA ACTGAATTGG AATAGAATTG GTAAGTTGCA 
GATCTCAATC TAATTAGACG TAAAATTTTT TGACTTAACC TTATCTTAAC CATTCAACGT 

2161 AAGACTT-TT GAAAATAATT AAATTATCAT ATCTTCCATT CCTGTTATTG GAGATGAAAA 
TTCTGAAAAA CTTTTATTAA TTTAATAGTA TAGAAGGTAA GGACAATAAC CTCTACTTTT 

2221 TAAAAAGCAA CTTATGAAAG TAGACATTCA GATCCAGCCA TTACTAACCT ATTCCTTTTT 
ATTTTTCGTT GAATACTTTC ATCTGTAAGT CTAGGTCGGT AATGATTGGA TAAGGAAAAA 

2281 TGGGGAAATC TGAGCCTAGC TCAGAAAAAC ATAAAGCACC TTGAAAAAGA C7TGGCAGCT 
ACCCCTTTAG ACTCGGATCG AGTCTTTtTG TATTTCGTGG AACTTTTTCT GAACCGTCGA 

2341 TCCTGATAAA GCGTGCTGTG CTGTGCAGTA GGAACACATC CTATTTATTG TGATGTTGTG 
AGGACTATTT CGCACGACAC GACACGTCAT CCTTGTGTAG GATAAATAAC ACTACAACAC 

2401 GTTTTATCAT CTTAAACTCT GTTCCATACA CTTGTATAAA TACATGGATA TTTTTATGTA 
CAAAATAATA GAATTTGAGA CAAGGTATGT GAACATATTT ATGTACCTAT AAAAATACAT 

2461 CAGAAGTATG 7CTCT 
GTCTTCA7AC AGAGA 
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Figure 7. 



New Sequence + Incyte ESTs 



1 ATTTGTTTAA ACCTTGGGAA ACTGGTTCAG GTCCAGGTTT TGCTTTGATC CTTTTCAAAA 
TAAACAAATT TGGAACCCTT TGACCAAGTC CAGGTCCAAA ACGAAACTAG GAAAAGTTTT 

61 ACTGGAGACA CAGAAGAGGG CTTCTAGGAA AAAGTTTTGG GATGGGATTA TGTGGAAACT 
TGACCTCTGT GTCTTCTCCC GAAGATCCTT TTTCAAAACC CTACCCTAAT ACACCTTTGA 

121 ACCCTGCGAT TCTCTGCTGC CAGAGCAGGC TCGGCGCTTC CACCCCAGTG CAGCCTTCCC 
TGGGACGCTA AGAGACGACG GTCTCGTCCG AGCCGCGAAG GTGGGGTCAC GXCGGAAGGG 

181 CTGGCGGTGG TGAAAGAGAC TCGGGAGTCG CTGCTTCCAA AGTGCCCGCC GTGAGlGAGC 
GACCGCCACC ACTTTCTCTG AGCCCTCAGC GACGAAGGTT TCACGGGCGG CACTCACTCG 

+2 Met SerLeuPhe GlyLeuLcu LeuLeuThrSer AlaLeviAl 

1— — 

241 TCTCACCCCA GTCAGCCAAA TGAGCCTCTT CGGGCTTCTC CTGCTGACAT CTGCCCTGGC 
AGAGTGGGGT CAGTCGGTTT ACTCGGAGAA GCCCGAAGAG GACGACTGTA GACGGGACCG 

+2 aGlyGlr-Arg GlnGlyThrGln AlaGluSer AsnLeuSer SerLysPheGln PheSerSe 



301 CGGCCAGAGA CAGGGGACTC AGGCGGAATC CAACCTGAGT AGTAAATTCC AGTTTTCCAG 
GCCGGTCTCT GTCCCCTGAG TCCGCCTTAG GTTGGACTCA TCATTTAAGG TCAAAAGGTC 

♦ 2 rAcnLysGlu GlnTyrGlyVal GlnAspPro GlnHieGlu ArgllelleThr ValSerTh 



361 CAACAAGGAA CAGTACGGAG TACAAGATCC TCAGCATGAG AGAATTATTA CTGTGTCTAC 
GTTGTTCCTT GTCATGCCTC ATGTTCTAGG AGTCGTACTC TCTTAATAAT GACACAGATG 

♦2 rAsrvGlySer IleHisSerPro ArgPhePro HisThrTyr ProArgAsnThr ValLcuVa 



421 TAATGGA%GT ATTCACAGCC CAAGGTTTCC TCATACTTAT CCAAGAAATA CGGTCTTGGT 
ATTACCTTCA TAAGTGTCGG GTTCCAAAGG AGTATGAATA GGTTCTTTAT GCCAGAACCA 

*2 lTrpArgL«u ValAl&ValGlu GluAsnVal TrpIleGln LeuThrPheAsp GluArgPh 



481 ATGGAGATTA GTAGCAGTAG AGGAAAATGT ATGGATACAA CTTACGTTTG ATGAAAGATT 
TACCTCTAAT CAT CGT CATC TCCTTTTACA TACCTATGTT GAATGCAAAC TACTTTCTAA 

♦2 «GlyLeuGlu AspProGluAsp AspIleCye LysTyrAsp PheValGluVal GluGluPr 



541 TGGGCTT3AA GACCCAGAAG ATGACATATG CAAGTATGAT TTTGTAGAAG TTGAGGAACC 
ACCCGAACTT CTGCGTCTTC TACTGTATAC GTTCATACTA AAACATCTTC AACTCCTTGG 

+2 oSerAspC-iy ThrlleLeuGly ArgTrpCys GlySerGly ThrValProGly LysGlnll 



601 CAGTGATGGA ACTATATTAG GGCGCTGGTG TGGTTCTGGT ACTGTACCAG GAAAACAGAT 
GTCACTACCT TGATATAATC CCGCGACCAC ACCAAGACCA TGACATGGTC CTTTTGTCTA 

+2 eSerLysSly AsnGlnXlcJirg Xl«ArgPh* ValSerAsp CluTyrPhcPr© SerGluP* 



661 TTCTAAAGGA AATCAAATTA GGATAAGATT TGTATCTGAT GAATATTTTC CTTCTGAACC 
AAGATTTCCT TTAGTTTAAT CCTATTCTAA ACATAGACTA CTTATAAAAG GAAGACTTGG 
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+2 oGlyPhecys IlaHisTyrAan XleValMot JProGlnPhe XhrCluAlaVal SerProS* 



721 AGGGTTCTGC ATCCACTACA ACATTGTCAT GCCACAATTC ACAGAAGCTG TGAGTCCTTC 
TCCCAAGACG TAGGTGATGT XGTAACAGTA CGGTGTTAAG XGXCTTCGAC ACTCAGGAAG 

+2 rValLeuPro ProSerAlaLeu ProLeuAsp LcuLeuAsn AsnAlalleThr AlaPheSe 



781 AGTGCTACCC CCTTCAGCTT TGCCACTGGA CCTGCTTAAT AATGCTATAA CTGCCTTTAG 
ICACGAXGGG GGAAGTCGAA ACGGTGACCX GGACGAATTA TTACGATATT GACGGAAAXC 

+2 rThrLeuGlu AspLeuIleArg XyrLeuGlu ProGluArg TrpGlnLeuAsp L«uGluAs 



641 TACCXTSGAA GACCTTATTC GATATCTTGA ACCAGAGAGA TGGCAGTXGG ACTTAGAAGA 
ATGGAACCXT CXGCAATAAG CTATAGAACT TGGTCTCTCT ACCGTCAACC TGAATCTTCT 

+2 pLeuTyrArg ProThxTrpGln LeuLcuGly LysAlaPhe ValPheGlyArg LysScrAr 



901 TCTATA?AGG CCAACTTGGC AACTTCTTGG CAAGGCTTTT GTTTTTGGAA GAAAATCCAG 
AGATATATCC GGTTGAACCG TTGAAGAACC GTTCCGAAAA CAAAAACCTT CTTTTAGGTC 

+2 gValValAsp LcuAsnLeuLeu ThrGluGlu ValArgLeu TyrSerCysThr ProArgAs 



961 AGTGGTGGAT CTGAACCTTC TAACAGAGGA GGTAAGATTA TACAGCTGCA CACCTCGTAA 
TCACCACCTA GACTTGGAAG ATTGTCTCCT CCATTCTAAT ATGTCGACGT GTGGAGCATT 

+2 nPheSerVal SerlleArgGlu GluLeuLys ArgThrAsp ThrllePheTrp ProGlyCy 



1021 CTTCTCAGTG TCCATAAGGG AAGAACTAAA GAGAACCGAT ACCATTTTCT GGCCAGGTTG 
GAAGAGICAC AGGTATTCCC TTCTTGATTT CTCTTGGCTA TGGTAAAAGA CCGGTCCAAC 

+2 sdeuLeMVal LysArgCysGly GlyAsnCys AlaCysCys LeuKisAsnCys AsnGluCy 



10S1 XCXCCXSGTT AAACGCTGTG GTGGGAACTG TGCCTGTTGT CTCCACAATT GCAATGAATG 
AGAGGACCAA TTTGCGACAC CACCCTTGXC ACGGACAACA GAGGTGTTAA CGTTACTTAC 

+2 sGlnCysVal ProSerLysVal XhrLysLya TyrHisGlu ValLeuGlnLeu ArgProLy 



1141 TCAATGrC-TC CCAAGCAAAG TTACTAAAAA AXACCACGAG GTCCTTCAGT TGAGACCAAA 
A3TTACft*AG GGTTCGTTTC AATGATTTTT TAXGGXGCXC CAGGAAGTCA ACTCTGGTTT 

*2 sXhrGlyVal ArgGlyLeuHia LysSerLeu ThrAspVal AlaLeuGluHis HisGluGl 



1201 GACCGGTGTC AGGGGATTGC ACAAATCACT CACCGACGTG GCCCTGGAGC ACCATGAGGA 
CTGGCG&ZAG TCCCCTAACG TGTTXAGXGA GTGGCTCCAC CGGGACCTCG TGGTACTCCT 

♦2 uCysAspCys ValCysArgGly SerThrGly Gly 



1261 GTGTGAC7GT GTGTGCAGA6 GGAGCACAGG AGGATAGCCG CATCACCACC AGCAGCTCTT 
CACACTGACA CACACGTCTC CCTCGTGTCC TCCTATCGGC GTAGTGGTGG TCGTCGAGAA 

1321 GCCCAGAC-CT GTGCAGXGCA GTGGCTGATT CTATTAGAGA ACGTATGCGT TATCTCCATC 
CGGGTCTCGA CACGTCACGT CACCGACTAA GATAATCTCT TGCATACGCA ATAGAGGTAG 

13 SI CrXAATCTCA GTTGTTTGCT TCAAGGACCT TTCATCTTCA GGATTTACAG TGCATTCXGA 
GAATTAGAOT CAACAAACGA AGTTCCTGGA AASTAGAAG? CCTAAATGTC ACGTAAGACT 
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1441 AAGAGGAGAC ATCAAACAGA ATTAGGAGTT GTGCAACAGC TCTTTTGAGA GGAGGCGEAA 
TXCTCCTCTC TAGTTTGTCT TAATCCTCAA CACGTTGTCG AGAAAACTCT CCTCCGGATT 

1501 AGGACAGGAG AAAAGGTCTT CAATCGTGGA AAGAAAATTA AATGTTGTAT TAAATAGATC 
TCCTGTCCTC TTTTCCAGAA GTTAGCACCT TT CTTTTAAT TTACAACATA ATTTATCTAG 

1561 ACCAGCTAGT TTCAGAGTTA CCATGTACGT ATTCCACTAG CTGGGTTCTG TATTTCAGTT 
TGCTCGATCA AAGTCTCAAT GGTACATGCA TAAGGTGATC GACCCAAGAC ATAAAGTCAA 

1621 CTTTCGATAC GGCTTAGGGT AATGTCAGTA CAGGAAAAAA ACTGTCCAAG TGAGCACCTG 
GAAAGCTATG CCGAATCCCA TTACAGTCAT GTCCTTTTTT TGACACGTTC ACTCGTGGAC 

1631 ATTCCGTTGC CTTGGCTTAA CTCTAAAGCT CCATGTCCTG GGCCTAAAAT CCTATAAAAT 
TAAGGCAACG GAACCGAATT GAGATTTCGA GGTACAGGAC CCGGATTTTA GCATATTTTA 

1741 CTGGATTTTT TTTTTTTTTT TTGCGCATAT TCACATATGT AAACCAGAAC ATTCTATGTA 
GACCTAAAAA AAAAAAAAAA AACGCGTATA AGTGTATACA TTTGGTCTTG TAAGATACAT 

1S01 CTACAAACCT GGTTTTTAAA AAGGAACTAT GTTGCTATGA ATTAAACTTG TGTCATGCTG 
GATGTTTGGA CCAAAAATTT TTCCTTGATA CAACGATACT TAATTTGAAC ACAGTACGAC 

1861 ATACGACAGA CTGGATTTTT CATATTTCTT ATTAAAATTT CTGCCATTTA GAAGAAGAGA 
TATCCTGTCT GACCTAAAAA GTATAAAGAA TAATTTTAAA GACGGTAAAT CTTCTTCTCT 

1921 ACTACA1TCA TGGTTTGGAA GAGATAAACC TGAAAAGAAG AGTGGCCTTA TCTTCACTTT 
TGATGTAAGT ACCAAACCTT CTCTATTTGG ACTTTTCTTC TCACCGGAAT AGAAGTGAAA 

1961 ATCGATAAGT CAGTTTATTT GTTTCATTGT GTACATTTTT ATATTCTCCT TTTGACATTA 
TAGCTATTCA GTCAAATAAA CAAAGTAACA CATGTAAAAA TATAAGAGGA AAACTGTAAT 

2041 TAACTGTTGG CTTTTCTAAT CTTGTTAAAT ATATCTATTT TTACCAAAGG TATTTAATAT 
ATT G A CAA CC GAAAAGATTA GAACAATTTA TATAGATAAA AATGGTTTCC ATAAATTATA 

2101 TCTTTT7TAT GACAACTTAG ATCAACTATT TTTAGCTTGG TAAATTTTTC TAAACACAAT 
AGAAAAAATA CTGTTGAATC TAGTTGATAA AAATCGAACC ATTTAAAAAG ATTTGTGTTA 

2161 TGTTATAC-CC AGAGGAACAA AGATGATATA AAATATTGTT GCTCTGACAA AAATACATGT 
ACAATATCGG TCTCCTTGTT TCTACTATAT TTTATAACAA CGAGACTGTT TTTATGTACA 

2221 ATTTCATTCT CGTATGGTGC TAGAGTTAGA TTAATCTGCA TTTTAAAAAA CTGAATTGGA 
TAAAGTAAGA GCATACCACG ATCTCAATCT AATTAGAOGT AAAATTTTTT GACTTAACCT 

2281 ATAGAATTGG TAAGTTGCAA AGACTTTTTG AAAATAATTA AATTATCATA TCTTCCATTC 
TATCTTAACC ATTCAACGTT TCTGAAAAAC TTTTATTAAT TTAATAGTAT AGAAGGTAAG 

2341 CTGTTATTGG AGATGAAAAT AAAAAGCAAC TTATGAAAGT AGACATTCAG ATCCAGCCAT 
GACAATAACC TCTACTTTTA TTTTTCGTTG AATACTTTCA TCTGTAAGTC TAGGTCGGTA 

2401 TACTAACC7A TTCCTTTTTT GGGGAAATCT GAGCCTAGCT CAGAAAAACA TAAAGCACCT 
ATGATTGGAT AAGGAAAAAA CCCCTTTAGA CTCGGATCGA GTCTTTTTGT ATTTCGTGGA 

2461 TGAAAAASAC TTGGCAGCTT CCTGATAAAG CGTGCTGTGC TGTGCAGTAG GAACACATCC 
ACTTTTTCTG AACCGTCGAA GGACTATTTC GCACGACACG ACACGTCATC CTTGTGTAGG 

2521 TATTTATTGT GATGTTGTGG TTTTATTATC TTAAACTCTG TTCCATACAC TTGTATAAAT 
AT AAATAA CA CTACAACACC AAAATAATAG AATTTGAGAC AAGGTATGTG AACATATTTA 
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2581 ACATGGATAT TTTTATGTAC AGAAGTATGT CTCTTAACCA GTTCACTTAT TGTACTCTGG 
TGTACCTATA AAAATACATG TCTTCATACA GAGAATTGGT CAAGTGAATA ACATGAGACC 

2641 CAATTTAAAA GAAAATCAGT AAAATATTTT GCTTGTAAAA TGCTTAATAT CGTGCCTAGG 
GTTAAATTTT CTTTTAGTCA TTTTATAAAA CGAACATTTT ACGAATTATA GCACGGATCC 

2701 TTATGTGGTG ACTATTTGAA TCAAAAATGT ATTGAATCAT CAAATAAAAG AATGTGGCTA 
AATACACCAC TGATAAACTT AGTTTTTACA TAACTTAGTA GTTTATTTTC TTACACCGAT 

2761 TTTXGGGGAG AAAATT 
AAAACCCCTC TOTTAA 



y j 
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Figure 8. Additional oligonucleotides used for amplification of entire ^ 
coding region 

5'-1 TTTGTTTAMCCTTGGGAAACTGG 
S'-2 GTCCAGGTTTT6CTTTGATCC 
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Figure 9. DNA Sequence Of Clones 4 & 7, Identical Clones Containing The 
Entire Open Reading Frame 



1 TTTGTTTAAA CCTTGGGAAA CTGGTTCAGG TCCAGGTTTT GCTTTGATCC TTTTCAAAAA 
AAACAAATTT GGAACCCTTT GACCAAGTCC AGGTCCAAAA CGAAACTAGG AAAAGTTTTT 

61 CTGGAGACAC AGAAGAGGGC TCTAGGAAAA AGTTTTGGAT GGGATTATGT GGAAACTACC 
GACCTCTGTG TCTTCTCCCG AGATCCTTTT TCAAAACCXA CCCTAATACA CCTTTGATGG 

121 CTGCGATTCT CTGCTGCCAG AGCAGGCTCG GCGCTTCCAC CCCAGTGCAG CCTTCCCCTG 
GACGCTAAGA GACGACGGTC TCGTCCGAGC CGCGAAGGTG GGGTCACGTC GGAAGGGCAC 

131 GCGGTGGTGA AAGAGACTCG GGAGTCGCTG CTTCCAAAGT GCCCGCCGTG AGTGAGCTCT 
CGCCACCACT TTCTCTGAGC CCTCAGCGAC GAAGGTTTCA CGGGCGGCAC TCACTCGAGA 

+2 MetSer LauPheGly LcuLouLou LeuThrSerAla LquAIaGI 

] 

241 CACCCCAGTC AGCCAAATGA GCCTCTTCGG GCTTCTCCTG CTGACATCTG CCCTGGCCGC 
GTGGGGTCAG TCGGTTTACT CGGAGAAGCC CGAAGAGGAC GACTGTAGAC GGGACCGGCC 

+2 yGlnArgGln GlyThrGlnAla GluSarAsn LeuSerSer LysPheGlnPha SarSerAs 



301 CCAGAGACAG GGCACTCAGG CGGAATCCAA CCTGAGTAGT AAATTCCAGT TTTCCAGCAA 
GGTCTCTGTC CCCTGAGTCC GCCTTAGGTT GG ACT CAT CA TTTAAGGTCA AAAGGTCGTT 

+2 nLysGluGln AsnGlyValGln AspProGln HisGluArg IlelleThrVal SerThrAs 



361 CAAGGAACAG AACGGAGTAC AAGATCCTCA GCATGAGAGA ATTATTACTG TGTCTACTAA 
GTTCCTTGTC TTGCCTCATG TTCTAGGAGT CGTACTCTCT TAATAATGAC ACAGATGATT 

+2 nGlySerlle HisSerProArg PheProHis ThrTyrPro ArgAsnThrVal LeuValTr 



421 TGGAAGTATT CACAGCCCAA GGTTTCCTCA TACTTATCCA AGAAATACGG TCTTGGTATG 
ACCTTCATAA GTGTCGGGTT CCAAAGGAGT ATGAATAGGT TCTTTATGCC AGAACCATAC 

+2 pArgLeuVal AlaValGluGlu AsnValTrp IleGlnteu ThrPheAspGlu ArgPheGi 



431 GAGATTAGTA GCAGTAGAGG AAAATGTATG GATACAACTT ACGTTTGATG AAAGATTTGG 
CTCTAATCAT CGTCATCTCC TTTTACATAC CTATGTTGAA TGCAAACTAC TTTCTAAACC 

+2 yLeuGluAsp ProCluAspAsp IleCysLys TyrAspPhe ValGluValGlu GluProSe 



541 GCTTGAAGAC CCAGAAGATG ACATATGCAA GTATGATTTT GTAGAAGTTG AGGAACCCAG 
CGAACITCTG GGTCTTCTAC TGTATACGTT CATACTAAAA CATCTTCAAC TCCTTGGGTC 

+2 rAspGl/Thr IleLeuGlyArg TrpCysGly SerClyThr ValProGlyLys GlnIl«Se 



601 TGATCCAACT ATATTAGCGC GCTGGTGTGG TTCTGGTACT GTACCAGGAA AACAGATTTC 
ACTACCTTGA TATAATCCCG CGACCACACC AAGACCATGA CATGGTCCTC TTGTCTAAAG 

+2 rLysGlyAsn GlnlleArglle ArgPheVal SerAspGlu TyrPheProSer GIuProGl 



661 TAAAGGAAAT CAAATTAGGA TAAGATTTGT ATCTGATGAA TATTTTCCTT C7GAACCAGG 
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ATTTCCTTTA GTTTAATCCT ATTCTAAACA TAGACTACTT ATAAAAGGAA GACTTGGXCC 
+2 yPheCys!l« HisTyrAsnlle ValMetPro GlnPheThr GluAlaValSsr ProSerVa 



72 X OTTCTGCATC CACXACAACA TTGTCATGCC ACAATTCACA GAAGCTGTGA GTCCTTCAGT 
CAAGACG7AG GTGATGTTGT AACAGTACGG TGTTAAGTGT CTTCGACACT CAGGAAQTCA 

+2 lLeuPrcPro SerAla&euPro SeuAspLeu leuAsnAsn AlalleThrAla PheSQrTh 



781 GCTACCCCCT TCAGCTTTGC CACTGGACCT GCTTAATAAT GCTATAACTG CCTTTAGTAC 
CGATGGSGGA AGTCGAAACG GTGACCTGGA CGAATTATTA CGATATTGAC GGAAATCATG 

♦2 rLeuGluAs? LeuIleArgTyr LeuGluPro GluArgTrp GlnLeuAspLeu GluAcpLe 



841 CTTGGAA3AC CTTATTCGAT ATCTTGAACC AGAGAGATGG CAGTTGGACT TAGAAGATCT 
GAACCTTCTG GAATAAGCTA TAGAACTTGG TCTCTCTACC GTCAACCTGA ATCTTCTAGA 

+2 uTyrArg?ro ThrTrpGlnLeu LeuGlyLys AlaPheVal PheGlyArgLys SarArgVa 



901 ATATAGGCCA ACTTGGCAAC TTCTTGGCAA GG CTTTTGTT TTTGGAA6AA AATCCAGAGT 
TATATCCGGT TGAACCGTTG AAGAACCGTT CCGAAAACAA AAACCTTCTT TTAGGTCTCA 

+2 iValAspLeu AsnLeuLeuThr GluGluVal ArgLeuTyr SerCyeThrPro ArgAsnPh 



961 GGTGGATCTG AACCTTCTAA CAGAGGAGGT AAGATTATAC AGCTGCACAC CTCGTAACTT 
CCACCTAGAC TTGGAAGATT GTCTCCTCCA TTCTAATATG TCGACGTGTG GAGCATTGAA 

+2 eSerValSer IleArgGluGlu LeuLysArg ThrAspThr IlePheTrpPro GlyCysLa 



1021 CTCAGTGTCC ATAAGGGAAG AACTAAAGAG AACCGATACC ATTTTCTGGC CAGGTTGTCT 
GAGTCACA3G TATTCCCTTC TTGATTTCTC TTGGCTATGG TAAAAGACCG GTCCAACAGA 

♦ 2 uLcuVallys ArgCysGlyGly AsnCysAla CysCysLeu HisAsnCysAsn GluCysGl 



1031 CCTGGTSAAA CGCTGTGGTG GGAACTGTGC CTGTTGTCTC CACAATTGCA ATGAATGTCA 
GGACCAATTT GCGACACCAC CCTTGACACG GACAACAGAG GTGTTAACGT TACTTACAGT 

+2 nCyeValrro SerLysValThr LyeLysTyr HisGluVal L«uGltiLeuArg ProLysTh 



1141 ATGTGTCCCA AGCAAAGTTA CTAAAAAATA CCACGAGGTC CTTCAGTTGA GACCAAAGAC 
TACACACGGT TCGTTTCAAT GATTTTTTAT GGTGCTCCAG GAAGTCAACT CTGGTTTCTG 

+2 rGlyValArg GlyLeuKisLys SerLeuThr AspValAla LeuGluHisHis GluGluCy 



1201 CGGTGTCAGG GGATTGCACA AATGACTCAC CGACGTGGCC CTGGAGCACC ATGAGGAGTG 
GCCACAGICC CCTAACGTGT TTAGTGAGTG GCTGCACCGG GACCTCGTGC TACTCCTCAC 

+ 2 sAspCysYal CysArgGlySer ThrGlyGly 



12 61 TGACTGTG-G TGCAGAGGGA GCACAGGAGG ATAGCCGCAT CACCACCAGC AGCTCTTGCC 
ACTGACACAC ACGTCTCCCT CGTGTCCTCC TATCGGCGTA GTGGTGGTCG TCGAGAACGG 

1321 CAGAGCTG7G CAGTGCAGTG GCTGATTCTA TTAGAGAACG TATGCGTTAT CTCCATCCTT 
GTCTCGACAr GTCACGTCAC CGACTAAGAT AATCTCT7GC ATACGCAATA GAGGTAGGAA 

1381 AATCTCAS:-? GTTTGCMCA AGGACCTTTC ATCTTCAGGA TTTACAGTGC ATTCTGAAAG 
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TTAGAGTCAA CAAACGAAGT TCCTGGAAAG TAGAAGTCCT AAATGTCACG TAAGACTTTC 

AGGAGACATC AAACAGAATT AGGAGTTGTG CAA 
TCCTCTGTAG TTTGTCTTAA TCCTCAACAC GTT 
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Figure 10. Predicted Full-length Polypeptide Sequence 

1 MSLFGLLLLT SALAGQRQGT QAESNLSSKF QFSSNKEQYG VQDPQHERII 

51 TVSTNGSIHS PRFPHTYPRW TVLVWRLVAV EENVWIQLTF DERFGLEDPE 

101 DDICKY^rVE VEEPSDGTIL GRWCGSGTVP GXOISXGNQI RIRFVSDEYF 

151 PSEPG7CIHY NXVMPQFTEA VSPSVLPPSX LPLDLLNNAI TAFSTLEDLI 

201 RYLEPSaWQL DLEDLYRPTW QLLGKAFVFG RXSRWDLW- LTEEVRLYSC 

251 TPRNFSVSIR EELKRTDTIF WPGCLLVKRC GGNCACCLHN CNECQCVPSX 

301 VTKKYHZVLQ LRPXTGVRGL HKSLTDVALE HHEECDCVCR GSTGG 
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Figure 11 



Alignment of VEGF-X with Other VEGFs 
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Figure 12. Variant Polypeptide Sequences 
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Figure 1 3. Primers for Expression of VEGF-X 

E.coli expression of domain- 

vegx-6 AATTGGATCCGAGAGTGGTGGATCTGAACC 

vegx-7 MTTGGATCCGGGAAGAAAATCCAGAGTGG 

vegx-8 GGTTG AATTCATTA I (ill I AGTAACTTTGCTTGGGACAC 

vegX-9 AATTGAATTCATTATCCTCCTGTGCTCCCTC 

Baculovirus/insect cell expression of full-length protein- 
vegbarf 

AATTGGATCC6GAGTCTCACCATCACCACCATCATGAATCCAACCTGAGTAGTAAATTC 
C 

vegbac2 AATTGAATTCGCTATCCTCCTGTGCTCCCTCTGC 
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1 >3993180H1 LUNGNON03 INCYTE 

2 cacaaatcactcaccgacgtcgccctgoagcaccatgaggngtgtgactctgtgtccagagggagcacacoaggatagcc 

3 gcatcaccaccagcagctcttgcccagagctgtgc&gtgcagtggctgattcta7tagagaacgt 

4 ccttaatctcagrcgtttc^ttcaaggacctttcatcttcaggat^ 

5 aattagoasttgtgcaac ag c ! x ^trrr o agaggaggctaaaccacaggagaaffacctctt ^ 

6 >3 510192h1 cohcnct01 incyte 

7 tccagtcca^tcgctgattctattagagaacgtatgcgttatctccatccttaatctcaow 
9 tcatcttcaggatttacactccattctoaaagaggagacatcaaacag 

9 gagmctaaaggacaggagaaaaggtcttcaatcgtcc^^ 

10 tcagacttaccatgtacgtattccactacctcggttctgtaw 

11 >2S59870H1 ADRETUTOi XNCYTS 

12 CACGAGGTCC^rrcAGTWAGACCAAAGACCGGT^^ 

13 TGAGGAGTGTGACTGTOTGTGCAGAGGGAGCACAGGGGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGC 

14 AGTKAGTCGCTGATTCTATTAGAGAACGTATGCGWATCTC 
13 TCTTCAGGATTTACAGTCCATTCTGAAAGAGGAGA 

1$ >397976?K1 LUNGTUT08 INCYTE 

17 GGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGtGGCTGATTCTATTAGAGAACGTATGC 

18 GTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCT 

19 ACATCAAACAGAATTAGGAGTTG7GCAACAGCTCTTTTGAGAGGAGGCCTAAAGCACACGACAAAACGTCCTCAATCGTO 

20 GAAAGAANAT7AAATC7TCTATTAAATAGACACCAGCT 

21 >39B0011H1 Z.U7TGTUT08 ItfCYTE 

2 2 GGAGGATAGCCGC ATC ACCACCAGCAGC TCTTGCCCAGAGCTGTGGAGTGCAGTCGCTGATTCTATTAGAGAACGTATGC 

23 GrTATCTC«TCCTTAATCTCAG?TGCrTGCTTCAAGGACCTTTCA^ 

24 CATCAAACACAATTAGGAGTTGIGCAACAGCTCTm^ 

25 AAAGAAAATTAAATOTTGTATTAAATAGATCACCA 

26 >4B25396K1 BLAOTIT01 INCYTi: 

2 7 GAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAAC 

2 9 GCAATCAATGTCAATGTGTCCCAAGCAAAOTTACTAAAAAATACCACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTC 

2 9 AGGCCATTCCACAAATCACTCACCGACGTGGCCCTGCAGCACCATGAGGAGTG7GACTGTGTC7GCAGACGGACCACAGC 

30 AGGATAGCCGCATCACCACCA 

31 >3 073703)11 BOHEWTC1 INCYTE 

3 2 AGAAAATCCAGAGTGGTGGATCTGAACCTTCTAACAGAQ GAGGTAACATTATACACCTCCACACCTCCTAACTTCTCAGT 

3 3 GTCCATAAGGGAAGAACTAAAGASAACCGATACCATTTTCTGGCCAGGtTGTCTCCTGGCTA 

34 GTGCCTGT?GTCTCCACAATTGCAATSAA7GTCAATGTGTCCCAAGCAAAGTTAC^^ 

35 TTGAGACCAAAGACCGGTGTCAGGC-C-ATTCCACAAATCA 
35 >1302S16H1 PMCNCTC2 INCYTE 

37 ACGAAATCAAATTAGGATAAGATTTG-T ATC TG ATGAATATTTTCCTTCTGAACCTTC TAACAGAGGAGGTAAGATTATAC 
3 8 ACCTGCACACCTCGTAACTTCTCAGTG W 

3 9 CCTGGTTAAACGCTGTGGTGGGWCTGTGCCTGTTGTC^ 

40 ACTAAAAAATACCACGAGGTCC 

41 >3684109H1 H2AANCT01 INCYTE 

4 2 ATTTCATCTTCAGGATTTACAG7GC A7TCTSAAANACGACAAATCAAAC ANAATTAGGAGTTGTGCAACAGCTCTTTTSA 
4 3 GAGGAGGCCTAAACGAC AGGAGAAAAGGTCTTCAATCGTGGAAANAAAATTAAATGTTGTATTAAAM 

4 4 GTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATWCAGTTCTTTCGATACGGCTTAGGGTAATGTCAG 

45 TACAGCAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTT 

46 >471318SK1 B3UIHCTC1 INCYTE 

4 7 CAAACTTACTAAAAAATACCAC SA3GTCCTTCAGTTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCACTCACCG 
4 8 ACGTGGCCCTGGAGCACCATGAGGAS7G7GACTGTGTGTGCAGAGGGAGCACAGGAGGATAGCCGCATCACCACCAGCAG 

4 9 CTCTTGCCCAGAGCTGTGCAGTGCAGTGGCKATTCTATTAGAGAACGTATGCGTTATCTCCATCCtTAATCTCAGTrGX 

50 TTGCT 

51 >4588a3Kl XERANCTC1 INCYTE 

52 ANGAGTTGCCCAGAGCTGTGCAG7GCAGTGGCTGATTCTATTAGAGAACGTATGCGTTA7CTCCATCCTTMTCTCAGTT 

5 3 GT7TGNTTCAAGGACCTT7CATC ttc aggatttacag TGCATTC7GAAAGAGGAGACATC AAACAGAATTAGGAGTTGTG 

54 CAACACCTCTTTTGAGACGAGCCCTAAACGNCACCACAAAAGGTCT^ 

55 ATAGATC 

56 >1303909H1 PLACSOT02 INCYTE 

57 AGGAAATCAAATTACCATAAGATTTGTATCTGATGJUTAWTTC^ 

58 AGCTGCACACCTCGTAACTTCTCAG7GTCCATAAGGCAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCT 

5 9 CCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCC AC AATTGCAATGAATGTC A.».TGTGTCCCAAG 

60 >2739211H1 OVARNOT09 INCVTS 

61 GTGCATTCTGAAAGAGGA3ACATCAAACAGAA?TAGGAGTTCTCC^ 

62 GAAAAGGTC1TCAATCGTGGAAAGAAAATTAAATGTTGTATTAAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACG 

63 7ATTCCACTAGCTGGGTTCTCTA7TTCAGTTCTCTCGATACGTCTTAGGGTAATC7CA 

64 GTGACCACCTGAT 

65 >332S531H1 3>THYNOTC3 INCYTE 

S 6 7GCAACAGCTCTTTTCAGACGAGCCCTAAAGGACAGCACAAAAGCTCTTCAATCGTGGAAAC 

67 AAATAGATCACCAGC7AGTT7CAC-AGTTACCATG7ACGTATTCCAC7AGCTGGGTTCTGTATTTCAGTTCT7TCGATACG 

6 8 GCTTAGGGTAATGTC AGTACAGGj^\A.VAACTGTGCAAG7GAGCACC7GAI7CCGnGCCt7GCT7AACCCTAAAGCWCC 
69 ATGTCNNGGGOfAAAANCGAAAAAT 

10 >3733 5$5H1 SMCCNOSC1 INCYTE 

1 1 CCTTAA7CTCAGT7G7TTGCTTCA^GGACC7T7CATCTTCAGGATTTACAGTGCATTCTGKAAGANGAGACATCAAACAG 
1 2 AATTAGGKGTTGTGCAAAAGCC'CCTCTCAGAGGACOCCTAAAGCACAGCAGAAAACCTCTKCAATCGTGGAAAGWAAATT 
7 3 AAATCTTCTATNAAATNGATCACCAGC t AG7TTCAGA3TTACCATC7ACGTATTCCAC7AGC7GGGNCNGTATTCAGTCT 

7 4 TTCGGAACGOC7TAGOGT AATGTCAG TACAOGAWAAAAACTGTGC AGTGA3 

75 >3 55<J223Hl SYNOKGTSl INCYTE p ^ (Lf. 
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7*6 XTTAAATAGATCACCAGCTJySTTTCAGAG^ 
77 ACpQCCTAGGGTMTCTCAGTACAGCAAAAAAACWTCCAAGT^ 
' 78 CTCCATOTCCTKttCCTAJUUttCGrAT^ 

79 ACATTCTATOTACKACAAMCTCGrTTTTAAAAAGGAAC 

80 >iS07477Bl OVORTDTOl XHCYTB 

81 ©CCtACOTTCAGACTWU:CA«^^ 

82 OTCAGTACAGGAAAAAAACTOTGCAAG^ 

83 TAAAATCGTATAAAATCTOOA 

84 >41*3378H1 HRSTMOT32 INCTTfi 
89 AATAOATCACCAOCrAGTTTCA^^ 

86 GCTrA0a€TA*TG7eAGtt!£AGGA^ 

87 ATGTCCTOCCCCTAAAATeCTATA 



Express Mail Label No. 

EL024661737US 

Date of Deposit: 12/21/99 



1 >2054S75H1 BffPINOTOl INCYTE 

2 AAAGGAAC?AWTTGCTATGAArTAAAC TTGTGTCGTCCTGATAGGACAGACTGGATTTTTCATATTTCTrATTAAAATT 

3 TCTGCCATraAGAAGAAQAGAAOTACATTCArGGTm^ 

4 TATCCATAAGTCAGTTrATTTGTTTCATTCTOTACATT^ 

5 TCTTGTTAAATATATCTAWriTACCAAAGGTATlT'AAlATTCTTTTTTA 

6 >3993l80Hl INCITE ^ 
? CACAAATCACTCACC^CCTCGCCCTGGACCAC^ 

8 CCATCACCACCAGCACCTCriWCCAGAGCTGTCC^ 

9 CCTTAATCTCAGTTGTTTGCrTCAAGGACCTTTCATCtTCAGGATTTOCAGTCCATTC 

10 AATTACClAGTTGTGCAACAGCTCTtTtCAGAGGAGGCTAAA^K^CAGCACAAKAGCTCTT 

11 >3510192H1 CONCKOTO: 1NCYTE 

1 2 TGCACTGCACTCCCTGATTCTAI7A^AGAACCTAT(^G?TAl^TCCATCCTTMTCTCAaTTGTTTCCTTCAACGACCW 

13 TCATCTTCAGGATTTACAGTGCATTCTCAAAGAGCMACATC^^ 

14 GAGCCCTAAAGGACAG£AGAAAAGG7CTTCAATCGTO3^^ 

15 TCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTC TGTATTT 

16 >4164633H1 BRSTNOT32 INCYT2 
IT CrrGTTAAATATATCTATOIWACCAAACGTA^ 

18 GTAAATTTTTCTAAACACAATTCTTATAGCCA«G<iAACAAACATGATATAAAATATTG 

19 TAWTCATTCTCCTATGGTGCTACACMACAT^^ 

2 0 AAGACTTTTTGAyAATAATTAMrTATCATATCTTCCATTCCTGTTATTGGGGOAGAAAAT 

21 >2559870H1 ADRETOT01 XNCYTE 

22 CACGAGGTCCT7CAGTTGAGACCAAAGACCGGTGTCAGGGGATl^CACAAATCACTC\CCGACGTGGCCCTG0ACCACtt 

23 TGACGAGTCTGACTGTCTGTOCAGACGGACCACAG^ 

24 AGTCCAGTGGCTGATTCTATTAGAGAACGTATGCGrTATCTCCATCCTTAATCTCAG^^ 

25 TCTTCAGGATTTACA GTGCATTC TGAAAGAGGAGA 

26 >3917470K1 SONSTUTC1 IKCVTE 

2 7 TTAAAAACCAACTATGTTCCTATGAATTAAACTTGTGTCATGCTGATAGGACA^ 

2 8 AATTTCTCCCATTTAGAAGAAGASAACTACATTCATGGT^ 

29 ACTTTATCGATAAGTCACCTTATTTCTrCCATTGT^ 

30 TAATCfOTTAAATATATCTAttTTTACCAAAGGTATXTAATATTCTXT 

31 >3979767H1 tONGWTO* IWCVT3 

3 2 <^AGGATAGCCGGATCACCACCAGCAGCTCT7GCCCAGAGCrGTGCAGTGCAG7GGC7GA7TCTATrA.GAGAACGTAT^ 
3 3 GTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATrTTC 

3 4 ACATCAAACAGAATTAGGAGTTGTCCAACAGCTCTTTTGA^ 

35 GAAAGAANATTAAATGTTGTATTAAATAGACACCACCT 

36 >3980C11H1 LONGT'JTOS IRCYTE 

3 7 GGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTC^AGTGCAGTGGCTGATTCTATrAGAGAACGTArGC 
3 S GTTAT CTCCATCC7TAATCTCAGTTG T T TG C^C AAGG ACCTTTCATCTTCAGG ATTTACA ?G C ATTCTGAAAGAGGAGA 

3 9 CATCAAACAGAATTAGCAGTTGTGCAACAGCTCCTTTGAQAGGAGCCCT^ 

40 AAACAAAATT3AAPGTTGTA7TAAATASATCACCA 

41 >4925396H1 BtADOITCl INCITE 

42 GAGAACCGATACCATTTTCTGGCCACGTX^TCTCCTGGTTAAACGCTGTGGTGGGAACTGTC 

43 GCAATGAATGTCAAXGTGTCCCAAGCAAAGTTAC^AAAAAATACCACGAGGTCCTTCAGITGAGACCAAAGACCGGTGTC 

44 AGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGCACCATGAGGAGXGTGACTCTGTGTGCAGAGGGAGCACAGG 

45 AGCATAGCCGC ATCACC ACCA 

46 >3073703H1 BONEUNTCi IHCTfTS 

4 7 ACAAAATCCAGAGTCGCGGATCTGAACCTTCTAACAGAQGAGOTAAGATTATACAGCTGCACACCTCGTAACTTCTCAGT 
4 8 GTCCATAACGGAAGAACTAAAGAGAACCGAIACCATraTCTGGCCAGGTTGT^ 

4 9 GTCCCTGMGTCTCCACAATCGCAA7G AATGTCAA^^ 

50 TTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCA 

51 >952169K1 BSUITCTC3 INCYTE 

5 2 AGATGATATAAAATATTGTTGCTCTG AC AAAAATACATGTAT?XCATTCTCGTATGGTGCT AGAGTTAGATTAATCTCCA 

53 TTTrAAAAAACTGAATTCCAATACAATTGGTAAGTTGCAAAGACTTTT^ 

5 4 CTCTTATTGGAGATGAAAATAAAAAGC AACTTATGAAAGTAGACATTCAGATCCAGCCAXTACTAACCTATTCCT TTTTT 

55 GGGGAAATCtGAGCCTAC-C 

56 >420138SR1 SAAITUT29 IKCVTE 

. 57 TTrcTAAAAAGGAACTATGTTGCTATCAATTAAACTTCTGTC 
58 TAAAATTTCTCCCATTTAGAAGAAGAGAACTACAnCATGvT 

5 9 TCACTTTATCGATAAGTCAGTTTATTTGTrcCAT^ 

60 CTAATCTGTTAAATATATCTATTTTTACCAAAGGTATT7AA-TAT 

61 M302S16H1 PLACKOTC2 IflcrrE 

62 AGGAAATCAAATTAGGA7AAG A7TTGCA 7CTGAT0 AATATTTTCCTTCTGAACC rTCTAACAG AGGAGGTAAGATTATAC 

6 3 AGCTGCACACCTCGTAACTTCTCAGTG TCCArAAGGGAAGAACTAAACAGAACCGATACCATTTTCTGGCCAGGTTGTCT 
6 4 CCTGGTTAAACGCTGTCGTGCGAACTG TGCCTGmTCTCCCACAATTGCAAlNSAATGTCAATGTGTCCCAAGCAAAGTT 

65 ACTAAAAAATAC CACGAGCTCC 

66 >3684109K1 HEAA.N07C1 INCTTE 

67 ATTTCATCITCAGGATTTACAC-TSCArTCTGAAAWAGGAGAAXTCAAAC^AATTAGGAGTO 

6 8 GACCAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAAXAAAATTA.^ATGT7GTAT?AAATAGATCACCAGCTA 
69 GTTTCAGAGTTACCATGtACGTATrCCACTAGCTGGGTTCTCTATTTCACTTCTrTCGATACGGCTTACGGTAATGTCAG 

7 0 TACAGGAAAAAAACTGTGCAA37GACCACCTGATTCCGTXGCCTTGCTT 

71 >2549720H1 tWCTUT;6 INCITE 

7 2 «AGCTTCGNAXA?TT?CC*AAA:Ae 

7 3 AATACA TGTATTTCA T7CTCCT A"GG TGCT AG AGTTAG A?TAATCTGCA?TC^AAAAAACTGAATTGG AA7AGAA7TGGT 
7 4 AAGT TGC AAAG AC TT TT TGAAAA 7 AAT 7AAAT TATC AT AT CTTC C ATTC CTG TTATTCG AC AT3AAAATAAAAAGC AACT 



75 TATCANAGTAC 
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76 >877279H1 tWGASTOl INCYM 

7 7 CTTTTCTATCACAACrTAGATCAACTATTTTlA^ 

7 B GATGATATAAAATATTGTTGCTCTGACAAAAA7ACAKTATTrc^^ 

79 mAAAAMCTCAATTGGAATAGAATTGGTAAGTTCCAAAGOCTTrTTGA^ 

80 TGTTATTGGttGG 

81 >4713188H1 BSAIXCTOl TUCYTS ^ 

82 CAAACCTACTAAAAAATACCACCA3GTCCTTCACSTO 

83 ACGTGOCCCTGGAGCACCATGAGGAGTGIGACTGTGTGTGCAGAGG^ 

8 i CTCTTGCCCAOAGCTGTGCAGTGCAGTCjCTGATTCTATTAGAG^ 
83 TTGCT 

86 >2171083H1 BNDOTOTC3 INCYTE 

87 ACATAAACCTGAAAAGAACAGTGGCCTTATCWCACTTTATCGATAAG^CAGTTTATXTGTTTCATTGTGTACATTTTOA 

88 TATTCTCCTTTTGACATTATAACTGTTOGCTTTIW^ 

89 C7TTTMATGACAACTCAGATCAACrATTT*rTAGCTT<OTAAATT 

90 CATGA 

91 >8758S0H1 LUHGAS70L INCYTS 
$2 CTWATTCTTCATATTTCTTATTAAAATTT^ 

93 TGAAAAGAAGA5TCGCCTTATCTICACTTTATCGATAAGTCAGTTTATTTGTTTC 

9 4 TTTGACATTATAACTGTTGGCTT7TCTAATCTTGTTAAATATATCTATTTT7 

95 GAG 

96 >7C6268H1 SYN0KA7Q4 INCYTE 

97 GCTCATAWCACA7ATGTAAACCAGAACATTCTAT0TACTACAXACCTGGTTTX7AAAAAGGANCTATGTTGCTATCAAT 

98 TAAACTTGTCTCGTGCTGATAG-oACAGACTGGATTmCATATCT^ 
9 9 TACATWTGQTTTGGAAGAGATJ^CCTGAAAASAAGAGTGGCC^ 

100 TTCA 

101 >453223H1 X£FAXOTCi IUCYTB 

1 02 ANGAGTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACQTATX^GTTATCTCCATCCTTAATCTCAGTT 

103 GTTTGOTTCAAGGACCTr:CATCTrCAGGATTTACAGTGCATTCTGAAAGAGGAGACA« 

1 04 CAAC^GCTCTTTTGAGACGAGGCCTAAAGGNCAGGAGAA^^ 

105 ATAGATC 

106 >53B436H1 LNCDKOT02 IJ7CYTE 

107 AMCATMTATAAAATATTOTTGCTCTCACAAAAATACATGTATTTCATTCTTO 

108 CATTTTAAAAAACTGAATTGGAATASAATTGGTMGTTCCAAAGACTn^ 

109 TCCTCTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAGTAGACATTCAGATCCAGCCATTA 

110 >1303909K1 FLACNOTC2 INCYTE 

111 AGGAAATCAAAMACGATAAGArrrOTATCTGATOMTATr^ 

112 AGCTC CACACCTCGTAACTTCTCAGTt?TCCATAAGGGAAGAACTAAAGAGAACCGATACCA75 l TTCTCGCCAGGTTGTCT 

1 13 CCTGGTTAAACGCTGTGGTGGGAAC'TG'rGCCTGTTGrCTCC ACAATTCCAAO?GAATGTCAATGTG'TCCCAnG 

114 >2739211H1 OVJULVOTC9 ItfCYTS 

115 GTCaTATTCTGAAAGAGGAGACATCAAACAGAAT^^^ 

116 CAAAAGGTCTT(^TCC7CX?AAA<i^^ 

1 17 TATXCCACTAGCTGGGTTCTGTATrrCAGTTCTTTCGATaCGGC'TTAGGGTAATGTCAGTACAGGAAAAAAACTCTGCAA 

118 GTGACCACCTGAT 

119 >255C343H1 LUMGTCTCa INCVT3 

120 T5TACACTTTTATATTCTCCT?T?GACATTA£AACTCTTGOCT^ 

121 GTATTTAATAT7CT W;?ATGACAACT7AGATCAACT AT7TTTACXTTCGTAAATTTTTCTAAAC ACAATTGrTATAGC 

1 22 CAGAGGAACAAAGATCATATAAAATATTGT7GCTCTGACAAAAATACATGTATTTCATTCTCGTATGGTGCTA 

123 >5321148H1 ?IB?F2k:£ INCYTE 

124 CACAATTGTTATAGCCAGA3GAACAAAGACGATATAAAATAT7GTTGCTCTGNCAAAAAXACATGTATTTCATTCTCGTA 

125 TGGTGCTAGAGTTAGAt TAATC 7GCA7TT7AA>JiAACTGAAr?GGAA7AGAATTGGTAAG TTGCAAAGACTT77TGAAAA 

126 TAATTAAA7TATCArATC7TCCATTCCTC7TATTGC^^ 

127 CATTACTAAC 

128 >S79495H1 THYHNC7C2 INCYTE 

129 ATTTCATTCTCGTATGG?GC?AGA3rrAGATT^^ 

130 AGAC7TTTTGAAAATAA7TAAATTATC ATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACrrATGA.VMT 

131 AGAC AT7 C AGATCC ACCC A7TAC T AACC7ATTCCTTTTTTGCC0AAATC7GAGC CT AGCTC A3AAAAACA7AAA5CACCF 

132 TCAAAAA 

132 >332S591H1 PTHYNC7C3 XWCY7E 

134 TCCAACAGCTCrTTTGA<^6GAGSCC7AA^ 

135 AAATAGATCACCAGCTAGTTTCASAG7TACCATG7ACGTATTCCACTAGC7GGGTTCT 

136 CCTTAGCGTAATGTCAGTACAGO AAAAAAACTSTGC AAGTGAGCACCTGArTCCGrrGCCTTGCTTAACCCTAAAGCNCC 

137 ATGTCrWGGCCNAAAANCGAAAAAT 

138 >543890K1 OVARiIO702 INCYTE 

139 TTTCTAAACACAATTGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCA 

140 TTCTCGTAT^GT l GCTAG^GTTAGArTAA7C7GCATTTTAAAAAAC7GAA77GGtrATAGAA7TGGTAAG7TGCAAAGNCTr 

14 1 TTTGAAAATAATTA.\\77ATCA7ATCTTCCATTCCTGTTATTGGAGGA7C^AAAA7AA 
1*2 ACATTCAGATC 

143 >3733565H1 SMCC^GSCl IWCYTE 

144 CC TTAATCTC AGTTGTTTGCTTC AAGG ACCTTTCATCTTC AGGATTTACAGTGCATTCTGriAAGANGAGACATCAAACAG 
14 5 AAT TAGCN5TTGTGCAAAAGC TCTTT7GAGAGGAGGCCTAAAGG ACAGGAGAAAAGGTC7NC AATCGTCGAAAGNAAATT 

145 AAACCTTCTATNAAA T^GATCACCAGCTASTTTC ACXCTTACCATGTACGTATTCCACTACCTGGaNCNGTATTCAGTCT 
1 4 ? T7C GG AA CCCC TTACCG7AA TG*XAG TAC AGGAN AAAAAC TG 7C C AGTG AG 

143 >464I939H1 ?RCSTM?Cj INCYTE 

14 9 GTAC7ACAAACCTGGTTT r?AAAAAGC-AACTATCT7GC7AT0AATTAAAC TTG7G "CCA TCCTG ATAGGACAGAC7GGAT 
ISO TTTNCATATTTCTTAT^AAATTTCTCCCATTTAG 
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151 GAAGAGTGCCCTTATCTTCACTmTCGATAAGTM^ 

152 ATAACGTGCCTTT 

153 >2QQ77eOHl TES7NOT03 INCYTE 

154 TTATATTCTCCmTGACATTATMCTGrTGGCTTTO^ 

155 ATTCTTTTTTATGACAACmGATCAACTAT^ 

156 AAAOXTC^TATAAAATATTCTTCCTCTGAJJAAAAATACATCTAT 

157 >3Q85331H1 KEAOTOT03 IffCYTE 

15 9 CKTCATATTCACATATGTAAACCAGAACATTCTATCTACTACAAACC^ 

155 AAACTTCTGTCCTCC TCATAGGAC AGACTCCN TT Tl ^>CATATf TC TTATTANAATTTCTGC CATTAGAAGAAC AGAACTA 

150 CATTCATGGTTTGGAAGAGATAAACCTGJUUkA^^ 

161 >3414043Hl P7HYNO704 INCYTE 

1 62 CCTCATAWCACATATCTAAA£CACAACATTCTATGTACTAC 

1 63 TAAACTTGTGTCGTGCTGATAG^CAGACTGGATTTTTCArATTTC 

164 TACATTCATGGTTTCGAAGAGATAAACCTGAAA 

165 >3705963H1 PEKCTOT07 INCYTE 

1 66 AKACTCTCCAAGTGAGCACCTGATTCCGrTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGGCCTAAAATCGTATAAAA 

167 TCTGGAimnnflnn;umnnnnxuuinGCTCATATTCACATO 

160 AAAAGGAACf ATCTTGCTATGAATTAAACTTGTGTCGTGCTGATAGC 

169 TTC TGCCATTAGAAG AAGAGAACTAOTTC ANGGTTTGGAAGAGATAACCCTGAAAAC ANGCG 

170 >S137051H1 OVARDIT04 INCYTE 

171 AAAAAACTC AATTGGAATAGAA TCCG 7AAGTTGCAAAGACTNTTTCAAAATAATTAAATTATC AT ATCT7 CC ATTCCTGT 

173 AAATCTGAGCCTACCTCAGAAAAACATAAAGCACCTTGAAAAA^^ 

174 GTAGGAACACATCCTATTTATTGTGATGKTGTGGTTTATTAT 

175 >3554223H1 SWONCT01 INCYTE 

176 ATTAAATAGATCACCAGCTAGTTTCAGAGTTACCATGTAOTTAT^ 

177 ACGGCTTAGGGTAATGTeAGTACASGAAAAAAACTCTCCAAGTG^ 

178 CTCCAJGTCCTGGGCCTAAAATCCTATAAAATCTGGA^ 

179 AC ATOCTATGTATOACAAACCTGGTTTCTAAAAAGGAAC 
18a >4507477H1 OVARTDTC1 XNCYTE 

1B1 CCCTAGTTTCAGAGMACCAC^TACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAAT 

182 GTCAGTACAGGAAAAAAACCCrcCAAGTCAGCACCTGATTCC^ 

183 TAAAATCGTATAAAATCTGCA 

184 >19SS646KI CONWOT01 INCYTE 

IBS TGGTAAGTTGCAAAGACTCTrTGAAAATAATTAAATTATCATAT^ 

186 AM^ATGAAAGTAGACATTCAGATCCAGCCATTACTAACCTATTC^ 

187 ACATAAAGCACCTTCAAAAASACTTGGCAGCTTCCTGATAAAGCGTGCTGTGCTGTGCAGTAGGGAACACATCCTATTTA 
169 TTCTCATGTTGTCCTTTATATCC7AAACC 

183 >4163378K1 BRSTOOT3 2 INCYTE 

190 AATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGNTCTGTATTTCAGTTCCTTTCGA'TACG 

191 GCTrAGGGTAATGTCAGTACAGGAAAAAAGCTCTGCAAGTGAGCACCXGAWCCGTTGCCTTGCTTAACCCT 

192 ATGTCCTCGGCCTAAAATCGTATA 

193 >5C95141H1 SPIKNON35 INCYT2 

1 94 A»TAAACCTGAAAAGAACAGTGCCCTTATHrTCACTrm 

195 TATTCTCCTTTTGAC^TTATAAC7SOTGGCTTTTCTAW^ 

196 CTTT 

197 >943 826H1 ADRJENOTCO INCYTE 

198 TATCGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAAT^^ 

199 AATAATTAAATTATCATATCTTCCAtTCCTGTTATTCCAGATGAAAATAAVAAGCAACTTAtG 

200 >345l273tfl UTRSNON03 INCYTE 

201 TTTTTTOTTTTGCTC^TAnCACATATSTAAACCNGA^ 

2 02 TTGCTATGAATTAAACTTGTGTCGTGCTGAT AGC ACAGACTCGATTTTTCAyATTyCTTAJJTAAKNTTTCTGCC ATTTAG 

204 >1402278H1 LATRTJ7 02 INCYTE 

205 GTACAGGAAAAAAACTGTCCAAG?GAGCACCIGATTCCGTTGCCTTGCTTAACTCTAAAGCTCCATG1CCTGGGCCTAAA 

206 ATCGTATAAAATC TGC Annrvnn n rur, ft r-T. nannnnnCCTCATA TTCA GATATGTAAACGAGAACATTC TATGT AC7ACAAA 

207 CCTGGTT7TTAAAAAGGAAC7A*GTTGCTATGAATTAAACTTGTGTCG7GCTGATAGCA^ 
2 OS CTTA 

209 >4361191H1 SJCIIWOT;X XNCYTE 

210 CCAAAGACTTTTTGANAAWATTAA.VTTATCATATCTTCC ATTCCTGTTATNGGAG ATGANAATAAAAAGCAACTTATGA 

211 AACTAGACATTCAGATCCAGCC A??ACTAACCTATTCC«TOTTGGGGAAATCTGAGCCTAGCNCAGAAAAACATAAAGC 

212 ACCTTCAAAMGACTTGGCAGC7TCCreATAAAGCGTGCTGTGCT0TGCACTA£MAACACATCCNATTTATTGTGNTGTS 

213 GNGGTTTTATGATC 

214 >1307017H1 PLACNOTC2 INCYTE 

21 5 rorcAGTACACCAAAAAAACTC7GCAA3TCAGCACCTOArrCCGTT^ 
21 5 CTAAAATCGTATAAAATCTGGAr.-jirjir.r.Ji2ini«^ 

217 ACAAACC7CGTTTTTAAAAAGGAACTA7GTTGCTATGAA77AAACTTGTGTCATGC7GA7AGGACAGACTGGATTTTTCA 

218 TAT 

219 >5032225H1 KEA3FE703 INCYTE 

220 AATTATCATA7C77CCATTCCTGT7A77GGAGATGKAAATAAAAAGCAACTTATGAAAG7AGACATTCAGATCCAGCCAT 

221 TACTAACCTATTCCTTTTTTGGG5AX\7CTGAGCCTAGCTCACAAAAACATAAAGCACCTTGAAAAAGACTGTCAGCTTC 
22 2 CT C ATAAAGCG TGCTGTGC TGTG C AG 7AGGAAC AC A TC CTATTT ATTGT GAT G TTC 7GCTTTT ATTATCTTAAACTCGTT 

223 CCAT 

224 >3U2621KI SKCCNOS:^ INCYTE 

225 A^7AGATGATA7AAA.VNAT7G7TGCTCTGACAANNATACATG7ATTTCATTCTCG7A7GGTGCTAGAGTTAGAT7AATCTG 
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22 iS CNTTTTAAWiAACTGAJflTGaAATAG 

227 >3530374H1 BLADHOX09 2WCYTK 

229 TTCCATTCCTGTTACTaaAOAraU^ 

229 C CmrnXjljO GAAATCTaAGCClAGCTCASAAAAJ^ 

230 TGCTOTGCTOTGCAOTAOGAAC^ 

231 TATAAXTACA^QQATATTTTCAroTAC^XAU VATCflCCTCT tAACCAGTTCA ^ 

232 >3330249H1 BLADKOT05 IRCTTS 

233 CTTCCATTCCMTTATTCOIU^TCAAAATAAAAAGCAACT^ 

234 TC Crm riW MAAATCTCACCCTACC^ 

235 GTCCTOTarTOTCCAOTAGGAACACATCCTATTTAT^ 

236 TGTATAAAffACATCOAlATTTTTAT^ACAGAAOTATG^ 
337 
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VECirEl 


AAA AT^VTATrV^ATAPA APTTAP 


11 




<TrTTrrATOAAAGATTTGGGCTTG 


23 




TTTCTAAA GG AAATCAAATTAG 


22 


VFGFE4 


GATAAGATTTGTATCTGATG 


20 


VEGFES 


GATGTCTCCTCTTTCAG 


17 


VEGFE6 


GCACAACTCCTAATTCTG 


18 


VEGFE7 


AGCACCTGATTCCGTTGC 


19 


VEGFE8 


TAGTACATAGAATGTTCTGG 


20 


VEGFE9 


AAGAGACATACTTCTGTAC 


19 


VEGFE10 


CCAGGTACAATAAGTGAACTG 


21 



4- 
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*3 KKIFLLNLLTiEVRLv 

) f 

1 AOGAMTCAA AOTAGGATAA GATTTGTAXC TCATGAATAT TTTCCTTCTC AACCTTCTAA CAGAGGAGGT AAfiAtTAfcAC 
TCCTTTAGTT TAATCCTATT CTAAACATAG ACTACTTATA AAAGGAAGAC TTGGAAGAXT GICTCCTCCA TICTAATATC 

♦3SCTP X N ? SVS IRBB L K R TUT X F W P GCi 

. — j :::::: 

81 AGCTGCACAC CtCGtAACTT CTCAGTGTCC ATAAGCGAAG AACTAAAGAG AACCGA7ACC ATXTTCXGGC CAGGTTGTCT 
TCGACGTCTG GAGCATTGAA GAOTCACAGG TATTCOCTTC TTQATTTCTC tTGGCTATGG TAAAAOACCG GTCGAACAGA 
-2 <— 

♦3 LVX R C G G tJ C A CCL HNCK E C Q C V P S K V 

161 CCTCOTTAAA CGCTGTGGTG GGAACTGTGC CXGCTSTCXC CACAAXTOCA A3CAATOTCA ATGXGTCCCA 'ACCAAAGTXA 
^ GGACCMTTT GCGACACCAC CCTTGACACC CACAACAGAG GtfGTTAACCT TACTTACAGT TACACAGGGT 7CGTT7CAA? 

+3 T K K Y KSV L Q L R 5> X T G V ft G L K X SLY DVA 

+1 VSGDCTNKSFTWF 

3 

241 CTAAAAAATA CCACGAGGIC CTTCAGTTGA GACCAAAGAC CGGTGTCAGG GGATXGCACA AATCACXCAC CGACGTGGCC 
GATPTTTTAT GGTGCICCAG GAAGTCAACT CTGGTrrCIG CCCACASTCC CCTAACGTGY TTAGTGAGTG GCTGCACCGG 
. 2 [ 

+3 L E H H EEC DCV C R C S t G G 

— . — . — > 

♦2 VQP. BHRRIAASPPAAI*A 

J 

•►1 W S X MUSV TVC A2G AQBO SRI TTS SSC 

321 CTCGAGCACC ATGAGGAGXC TGACTGTOTC TGCAGAGCGA GCACAGGAGG ATAGCCQCAT CACCACCAGC AGCXCTXGCC 
GACCTCGTCG TACTCCTCAC ACK»CACAC ACGTCTCCCT CGXOICCXCC TATCGGCGTA GTQGXGGXCG TCCAfiAACGC 

♦2 0SCA VQW Lll. LENV C V 2 SIX, KLSC L L & 

♦1PBLC S A V A D S X R E R MRY LHP 

— - — — — — > 

401 CACAGCTCTG CAGTGCAGTG QCTGATTCXA TtAGAGAACG TATGCGTTA? CTCCATCCTT AATCTCACTT GTTTGCT7CA 
GTCICGACAC GTCACC1CAC CGACTAACAT AATCTCTTGC ATACGCAATA GAGGTAGGAA TTAGAGTCAA CAAACGAAG? 

■O CPF X F R I Y S A F 

— 

431 AOGACCTTTC ATCTTCAGGA TTTACACTCC ATTCTGAAAG AGGAGACATC AAACAGA&l? AGGAGTTCTG CAACAGCTCT 
TCCTGGAAAG TACAAGTCCT AAATGTCACG TAAGACTITC TCCTCTOTAG rTTGTCXXAA TCCTCAACAC G7TGTCGAGA 

561 TTTGAGAGGA CGCCTAAAGG ACAGCAOAAA AGGTCTTCAA TCGXGGAAAG AAAATTAAAT GTTGTATTAA ATAfiMCACC 
AAACTCTCCf CCCGATTTCC TCTCCXCTCT 1CCAGAAGTX AGCACCTTTC TTTTAA1TTA CAACATAAT? TATCTAGTGG 

641 AGCTAOTTTC AGAGTTACCA XGTACGTATO CCACTAGCTG CCTCeTGTAT TTCAGTTCTT TCGATACCCC TTAGGGTAAT 
TCGATCAAAG 7CTGAA7GGT ACATGGATAA GGTCA7CGAC CCAAGACATA AAG7CAAGAA AQCTATGCCG AATCCCATTA 

721 GTCAGTACAG GAAAAAAACT GTGCAAGTGA CCACCTGATT CCGTTGCCTT CGC7TAACTC TAAAGCZCCA TGICC1GGCC 
CAGTCATGTC CTTTTTTTGA CACGTTCACT CGTGCACTAA GGCAACGGAA CCGAATTGAC ATTTCGACGT ACAGGACCCC 

8C1 CTAAAATOGT ATAAAATCTC GA 
GATTTTAGCA TATTTTAGAC CT 



ft i 1 
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♦3 K N I FLL N I t T E E V RLV 

] 

1 AGGAAATCAA ATTAGGATAA GATTTOTATC TGAfGAATAT TTTCCTTCTG AACCTTCTAA A&GAXTATAC 
TCCTTTACra IAATCCTATT CTAAACATJUS ACTACTTATA AAAGGAAGAC CTQGAACATT G1CTCCTCCA TTCTAATAIG 

♦3SCTP R N ? SVS I R E £ L K R TDT X F W P GCL 

81 AGCTGGAGAC CTCOIAACCT CTCAGTGTCC ATAAGGGAAG AACTAAAGAG AACCCATACC AUT1CIGGC CAOGMGTCT 
TOGACGTGXG GASCAIIGAA CAOTCftCAGG TATTCCCTTC TTGATTCCIC TTGCCTATGG TAAAAGACCG GTCCAftCAGft 
-2 < — 

-3 L V X R C G G N C A CCL HtfCN E C C CVP S X V 

161 CCTGGTTAAA CGCWTGGTC- GGAACTQWC C«WOTC« OACAATTGCA ATGAATOTCA AIGTGTCCCA AGCAAAGTTA 
COACCAACTT GCGACACCAC CCTTCACACC GACAACAC-AG GTGTTAACCT TACTTACAGT TACACACGOT TCGTTTCAAT 
-2 

H E V L Q L ft P X I GVR GLHX SLT OVA 

*1 VSG DCTMHSPTWP 

j 

241 CTAAAAAATA CCACGAGGTC CTTCAGTTGA GACCAAAGAC CCSTGTCAGG GGATTGCACA AASCACTCAC CGACGTCCCC 
GATOTTTCAT GGTGCTCCAC GAAGOCAACT C W m W CCCACAGTCC CCTAACGTGT TTAGTCAGTG GCTGCACCGG 
-2 C 

+3LEHH BBC DCV CRGS TGG 



*2 V QRB HRR I A A S P P A A I* A 

J 

♦1 WST KRSV T V C AEG A Q E 0 S St I * * S SSC 

321 CTGGAGCACC ATGAGGAGTG TQACTCTGTG T9CAGAGGGA GCACAGGAGG ATACCCGCA2 CACCACCAGC A3CTCTIGCC 
GACCTCGTGG TACTCCTCAC ACTCACACAC ACGTCTCCCT CCTCTCCTCC TATCGGCGTA GTGGTCGTCO TCGAGAACGG 

♦2 $ S C A V $ W L I L C V X S I L Nl.SC t L Q 

■*1PBLC S A V ADSI R E R MAY I* H P 

------- — — — > 

401 CASA GC TOTG CAGTGCAGTG GCTGATttCTA TTASAGAACG XATGCGTTAT CTCCATCCTT AA7CTCAGTT GTTTGCTTCA 

GTCTCOACAC CTCACGTCAC CGACTAAGAt AATCTCTTGC ATACGCAATA CAOGTAGCAA TTAGAGTCAA CAAACGAAGT 

+2 GPP JTRI TSA F 
> 

481 AGGACCTTTC ATCTTCAGGA TTTACAGTOC ATOTGAAAG AGGAGACATC AAACAGAATT AGGAGTTGTG GAACAGCTCX 
TCCTGGAAAG TAGAAGTCCT AAA03TCACG TAAGACTTCC TCCTCTGTAG TT7GTCTTAA TCCTCAACAC GTTGTCGAGA 

561 • TTTCAGAGGA GGCCTAAAGC ACAFGAGAAA AGOTCTTCAA TCGTGGAAAG AAAATTAAAI GTTGTATTAA A?AGA7CACC 
AAACTCTCCT CCGGATTTCC TSTCCTCWr TCCAGAAGTT ACCACCTTTC TTTTAATTTA CAACATAA7T TATCTAGTGG 

641 AGOTAGTTTC AGACTTACCA ?GTACGTATT CCACTAGCTG GGTTCTCXAT TTCAGTTCTT TCGATACGGC f TACCGtfAAT 
TGGATCAAAS TCTCAATC-GT ACATGCATAA CCTGATCGAC CCAAGACATA AAGKAA6AA AGCTATGCCG AATCCCATTA 

721 GTCAGTACAG CAAAAAAACT GIGCAAGTGA GCACCTGATT CCGTTGCCTT GGCTTAACTC TAAACCTCCA TGTCCTOGGC 
CAGTCATGTC CTTTTTTTGA CACGTTCACT CGTGGACTAA GGCAACOGAA CCGAATTGAG ATTTCGAGCT ACAGGACCCG 

801 CTAAAATCG? ATAAAATC7<3 GATTTmiN TTTTTTTTTG CGCATATTCA CATATGTAAA CCAGAACATT CTA3GTACTA 
GATTTTAGCA TATTTTAGAC CTAAAAAAAH AAAAAAAAAC GCCTATAAGT GTATACATTT SGTCTTGTAA GATACATGAT 

881 CAAACCTGGT TTTTAAAAAG GAACTATGTT GCTA7GAATT AAACTTGTGT CGTGCTGATA GGACAGACTG GATT7TTCAT 
CTTTGGACCA AAAATTTTTC C7TGATACAA CGATACTTAA TTTGAACACA GCACGACTAT CCTGTCTGAC CTAAAAAGTA 
-3 <• — 
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9*1 A l II VII A TT AAAATX3CXG CCATtTAOAA OAAOAGAACt ACATTCXTGO OTCCAAfcAO AXAAACCTGX" AAAGAAcaqt 
TAAAfiAATAA TJTTAAXGAC CCTAAATCTT CTTCTCITCA 1OTAACTACC AAACenCTC TATTTGCACT TTOCTTCTCA 

1041 GGCCTTATCT TCACWIATC CATAAOtCAG WTATOCTT TCATTCTGTA CA3TTTTATA TTCTCCTTTT GACATTATAA 
CCOOWLTACA ACftJUATAC CTATTCAOTC AAATAAACAA ACTAACACA7 GTAAAAATAT AAGAGGAAAA CTGTAATAIT 
-3 — — * 

X121 CTGXTOGCW TTCTAATCTr CTTAAAXAEA XCTATOTXA CCAAAGGtAr TTAATATTCT WTWAWM AACTTA6WC 
QACAACCOAA AAOATTAGAA CAATTTATAT ACATAAAAAT CCTPTCCATA AATTATAAGA AAAAATACTG TTGAATCTAC- 

1201 AACtATTTOT AGCTTOGTAA ATTTTTCTAA ACACAATtGT TATAGCCACA GGAACAAAGA TOATAIAAAA TATTGT7GCT 
TTGATAAAAA TOGAACCATT TAAAAACAXT TOTGTTAACA ATATC3GTCT CCTTOTTTCT ACTAXATTTX ATAACAACGA 

1281 CTGACAAAAA TACATGTAT7 TCA1TCICGT AtQCJTGCTAG AGfTAfiATIA ATCT3CATTT TAAAAAACTG AAfXGGAATA 
GA C T O TTTTT ATGTACATAA AGTAAGAQCft. TACCACCATC TCAATCTAAT TAGACSTAAA ATTTTTTGAC TTAACCT7AT 

13 SI GAATTGCTAA GTTGCAAAGA CTTTTTGAAA ATAAXTAAAT TATCASATCT 1CCATTCCTG TTATTGGAGA TGAAAASAAA 
CTTAACCATT CAACGTXTCT GAAAAACTTT TATTAATTTA ATAGTATAGA AGGXAACGAC AATAACCTCT ACTTTTATTT 

1441 AAGCAACTOA fGAAAGTAGA CATTCAGATC CACCCATTAC TAACCTATTC CCTTTTWOQ GAAATCTGAC CCTAGCTCAC 
WCGWGAAT ACTTTCAICT GTAACTCTAC CSCCGTAATG ATXGCATAAC GAAAAAACCC CTTTAGACTC GGATCGAGTC 

1S21 AAAAACATAA A2CACCTCCA AAAAGACTTC GCASCTTCCT GATAAAGCCT CCTOTCCtOT GCAGTAGGAA CACATCCTAT 
TTTTWTATT TCGTGGAACT TTOTCTGAAC CGTCGAAGCA CTATTTCGCA CGACACGACA QSTCATCCTT GTGTAQ3AXA 

1601 rSATTGXGAT GVXSGGZTT TATTA1CTTA AACI CTG TTC CATACACTT3 XATAAATACA TQCATATTTT XATGTACAGA 
AATAACACTA CAAC&CCAAA ATAATAGAAT CTTQAOACAAG CTAXOTCAAC ATATTTA1GT ACCTATAAAA ATACATGTCT 

A&XATGTCTC TTAACCAC7T CACTTATTGT ACCTOG 
TCATACAGAC AATTGQTCAA GTGAATAACA TGGACC 
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Figure 19* DNA and polypeptide sequence used for mammalian cell expression 



+1 mslfgllllcsalagqr 
a GGATCCAAAA TGAGCCTCTT CGGGCTTCTC CTGCTGACAT CTGCCCTGGC CGGCCAGAGA 

+1 q 9 t q a E S KLS SKFQ FSS NKE 
61 CAGGGGACTC AGGCGGAATC CAACCTGAGT AGTAAATTCC AGTTTTCCAG CAACAAGGAA 

+ 1QNGV QDP QKE RIIT VST N G S 
121 CAGAACGGAG TACAAGATCC TCAGCATGAG AGAATTATTA CTGTGTCTAC TAATGGAAGT 

♦ 1IHSP R F P H T Y P R N T V L V W R L 
181 ATTCACAGCC CAAGGTTTCC TCATACTTAT CCAAGAAATA CGGTCTTGGT ATGGAGATTA 

+1VAVE E N V tflQ LTFD EH? GLE 
241 GTAGCAGTAG AGGAAAATGT ATGGATACAA CTTACGTTTG ATGAAAGATT TGGGCTTGAA 

♦ 1DPED DIC K Y D FVEV EEP 5 <D G 
301 GACCCAGAAG ATGACATATG CAAGTATGAT TTTGTAGAAG TTGAGGAACC CAGTGATGGA 

♦1 T I L G RWC GSG TVPG X Q I SKG 
3 61 ACTATATTAG GGCGCTGGTG TGGTTCTGGT ACTGTACCAG GAAAACAGAT TTCTAAAGGA 

+ 1NQIR I R F VSD EYFP SEP GFC 
421 AATCAAATTA GGATAAGATT TGTATCTGAT GAATATTTTC CTTC7GAACC AGGGTTCTGC 

+1IHYN IVM P Q F TEAV SPS VLP 
481 ATCCACTACA ACATTGTCAT GCCACAATTC ACAGAAGCTG TGAGTCCTTC AGTGCTACCC 

+ 1 P S A L P L D L L N N A I T A F S TLB 
541 CCTTCAGCTT TGCCACTGGA CCTGCTTAAT AATGCTATAA CTGCCTTTAG TACCTTGGAA 

+ 1DLIR YLE PER W Q L D LED LYR 
601 GACCTTATTC GATATCTTGA ACCAGAGAGA TGG CAGTTGG ACTTAGAAGA TCTATATAGG 

+1PTWQ L L G KAF V F G R KSR VVD 
661 CCAACTTGGC AACTTCTTGG CAAGGCTTTT GTTTTTGGAA GAAAATCCAG AGTGGTGGAT 

+1LNLL TEE VRL YSCT PRN PSV 
721 CTGAACCTTC TAACAGAGGA GGTAAGATTA TACAGCTGCA CACCTCGTAA CTTCTCAGTG 

+ 1SIRE ELK RTD TIFW PGC LLV 
781 TCCATAAGGG AAGAACTAAA GAGAACCGAT ACCATTTTCT GGCCAGGTTG TCTCCTGGTT 

+1KRCG GNC ACC LKNC NEC QCV 
841 AAACGCTGTG GTGGGAACTG TGCCTGTTGT CTCCACAATT GCAATCAATG TCAATGTGTC 

+ 1PSKV TKK Y H E V L Q L RPK TGV 
901 CCAAGCAAAG TTACTAAAAA ATACCACGAG GTCCTTCAG7 TGAGACCAAA GACCGGTGTC 

+1 R G L H KSL TDV ALEH KEE CDC 
961 AGGGGATTGC ACAAATCACT CACCGACGTG GCCCTGGAGC ACCATGAGGA GTGTGACTGT 

+ 1VCRG STG G SR GPFS GKP IPN 
1021 GTGTGCAGAG GGAGCACAGG ACGATCTAGA GGCCCCTTCG AAGGTAAGCC TATCCCTAAC 

+ 1 PLLG LPS TRT CHHH KHH 
1051 CCTCTCCTCG GTCTCGATTC TACGCGTACC GGTCATCATC ACCATCACCA TTGA 
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Figure 20. DNA and polypeptide sequence used for baculovirus/insect cell expression 

1 GAATTCAAAG GCCTGTATTT TACTGTTTTC GTAACAGTTT TGTAATAAAA AAACCTATAA 

4.3 mlcf Ivn valv fmv v y i 3 y i 
61 ATATGAAATT CTTAGTCAAC GTTGCCCTTG TTTTTATGGT CGTATACATT TCTTACATCT 

+3 y a DP g S H HHHH H ES NLS S K F 
121 ATGCGGATCC GGAGTCTCAC CATCACCACC ATCATGAATC CAACCTGAGT AGTAAATTCC 

+3QFSS NKE QNGV QDP QHE RII 
181 AGTTTTCCAG C^ACAAGGAA CAGAACGGAG TACAAGATCC TCAGCATGAG AGAATTATTA 

+3TVST N G S IHSP R F P H T Y P R N 
241 CTGTGTCTAC 7AATGGAAGT ATTCACAGCC CAAGGTTTCC TCATACTTAT CCAAGAAATA 

+3TVLV *J R L VAVE E N V WIQ L T F 
301 CGGTCTTGGT A7GGAGATTA GTAGCAGTAG AGGAAAATGT ATGGATACAA CTTACGTTTG 

+3DERF GLE D P E E> DIG K Y D FVE 
361 ATGAAAGATT 7C-GGCTTGAA GACCCAGAAG ATGACATATG CAAGTATGAT TTTGTAGAAG 

+3 VEEP SDG TILG RWC GSG TVP 
421 TTGAGGAACC C-.GTGATGGA ACTATATTAG GGCGCTGGTG TGGTTCTGGT ACTGTACCAG 

♦3 G K Q I SKG NQIR IRF VSD E Y F 
481 GAAAACAGAT TTCTAAAGGA AATCAAATTA GGATAAGATT TGTATCTGAT GAATATTTTC 

+3PSEP GFC IHYN I V M PQF TEA 
54 1 CTTCTGAACC ASGGTTCTGC ATCCACTACA ACATTGTCAT GCCACAATTC ACAGAAGCTG 

+3VSPS VLP PSAL PLD LLN MAI 
601 TGAGTCCTTC A37GCTACCC CCTTCAGCTT TGCCACTGGA CCTGCTTAAT AATGCTATAA 

+3TAFS T L E DLIR YLE PER WQL 
661 CTGCCTTTAG 7ACCTTGGAA GACCTTATTC GATATCTTGA ACCAGAGAGA TGGCAGTTGG 

+3 D L E D L Y R PTWQ LLG KAF VFG 
721 ACTTAGAAGA TZTATATAGG CCAACTTGGC AACTTCTTGG CAAGGCTTTT GTTTTTGGAA 

+3RKSR VVD L N L L TEE VRL YSC 
781 GAAAATCCAG A3TGGTGGAT CTGAACCTTC TAACAGAGGA GGTAAGATTA TACAGCTGCA 

+3TPRN FSV SIRE ELK RTD TIF 
841 CACCTCGTAA C77CTCAGTG TCCATAAGGG AAGAACTAAA GAGAACCGAT ACCATTTTCT 

♦3WPGC LLV KRCG GNC ACC LHN 
901 GGCCAGGTTG 7C7CCTGGTT AAACGCTGTG GTGGGAACTG TGCCTGTTGT CTCCACAATT 

+.3CNEC QCV PSKV TKK YHE VLQ 
961 GCAATGAATG 7CAATGTGTC CCAAGCAAAG TTACTAAAAA ATACCACGAG GTCCTTCAGT 

+3 L R P K 7 G V RGLH KSL T D V ALE 
1021 TGAGACCAAA GACCGCTGTC AGGCGATTGC ACAAATCACT CACCGACCTG GCCCTGGAGC 

+3HHEE CDC VCRG STG G 
1081 ACCATGAGGA GTDTGACTGT GTGTGCAGAG GGAGCACAGG AGGATAGCTC TAGA 
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Figure 21. DNA and polypeptide sequence used for Exoli expression 



+3 OTN SSS NNNN NNW Wtt N LGI 
1 CGCAGACTAA TTCGAGCTCG AACAACAACA ACAATAACAA TAACAACAAC CTCGGGATCG 

+3 E G R I S E F ESNL SSK F Q F SSN 
61 AGGGAAGGAT TTCAGAATTC GAATCCAACC TGAGTAGTAA ATTCCAGTTT TCCAGCAACA 

+3 X E Q N GVQ D P Q H E R I I T V STN 
121 AGGAACAGAA CGGAGTACAA GATCCTCAGC ATGAGAGAAT TATTACTGTG TCTACTAATG 

+3GSIH S P R F P H T Y P R NTV LVW 
181 GAAGTATTCA CAGCCCAAGG TTTCCTCATA CTTATCCAAG AAATACGGTC TTGGTATGGA 

+3RLVA V E E N V W I QLT FDE RFG 
241 GATTAGTAGC AGTAGAGGAA AATGTATGGA TACAACTTAC GTTTGATGAA AGATTTGGGC 

+3LEDP EDD ICKV DFV EVE EPS 
301 TTGAAGACCC AGAAGATGAC ATATGCAAGT ATGATTTTGT AGAAGTTGAG GAACCCAGTG 

+3DGTI LGR WCGS GTV P G K QIS 
361 ATGGAACTAT ATTAGGGCGC TGGTGTGGTT CTGGTACTGT ACCAGGAAAA CAGATTTCTA 

+3KGNQ IRI RFVS D E Y FPS EPG 
421 AAGGAAATCA AATTAGGATA AGATTTGTAT CTGATGAATA TTTTCCTTCT GAACCAGGGT 

+3FCIH Y N I VMPQ FTE AVS P S V 
4 SI XCTGCATCCA CTACAACATT GTCATGCCAC AATTCACAGA AGCTGTGAGT CCTTCAGTGC 

+3LPPS ALP LDLL NNA I T A FST 
541 TACCCCCTTC AGCTTTGCCA CTGGACCTGC TTAATAATGC TATAACTGCC TTTAGTACCT 

+3 L E D L I R Y LEPE RWQ LDL E D L 
€01 TGGAAGACCT TATTCGATAT CTTGAACCAG AGAGATGGCA GTTGGACTTA GAAGATCTAT 

+3 Y R P T W Q L LGKA FVF GRX SRV 
661 ATAGGCCAAC TTGGCAACTT CTTGGCAAGG CTTTTGTTTT TGGAAGAAAA TCCAGAGTGG 

+3VDLN LLT EEVR LYS CTP RNF 
721 TGGATCTGAA CCTTCTAACA GAGGAGGTAA GATTATACAG. CTGCACACCT CGTAACTTCT 

+3SVSI REE LKRT DTI FWP GCL 
731 CAGTGTCCAT AAGGGAAGAA CTAAAGAGAA CCGATACCAT TTTCTGGCCA GGT1GTCTCC 

+ 3LVKR CGG NCAC CLH NCN ECQ 
541 TGGTTAAACG CTGTGGTGGG AACTGTGCCT GTTGTCTCCA CAATTGCAAT GAATGTCAAT 

+3CVPS KVT KKYH E V L QLR PKT 
901 GTGTCCCAAG CAAAGTTACT AAAAAATACC ACGAGGTCCT TCAGTTGAGA CCAAAGACCG 

♦3 G V R G LHK SXiTD VAL EKK EEC 
S61 GTGTCAGGGG ATTGCACAAA TCACTCACCG ACGTGGCCCT GGAGCACCAT GAGGAGTGTG 

+3DCVC RGS TGG H H H H H K * 
1021 ACTGTGTGTG CAGAGGGAGC ACAGGAGGAC ATCATCACCA TCACCATTGA TCTAGAGTCG 

10B1 ACCTGCAGGC AAGCTT 
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Figure 22, Disulphide-linked dimerisation of VEGF-X 




(B) E.coli expression 
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Figure 23. Glycosylate of VEGF-X 
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Figure 24. DNA and polypeptide sequence used for ELccli expression of the PDGF-like domain 



+3 K ft GSHH H HK HGW ASM 

1 AAGGAGATAT ACATATGCGG GGTTCTCATC ATCATCATCA TCATGGTATG GCTAGCATGA 

+3 TGGO nMG RDLY DDD D K „D £ G R 

€1 CTGGTGGACA GCAAATGGGT CGGGATCTGT ACGACGATGA CGATAAGGAT CCGGGAAGAA 

+3 K S R V VDL NLLT EEV R L Y SCT 
121 AATCCAGAGT GGTGGATCTG AACCTTCTAA CAGAGGAGGT AAGATTATAC AGCTGCACAC 

+3 P R N F SVS IRE E LKR TDT IFW 
181 CTCGTAACTT CTCAGTGTCC ATAAGGGAAG AACTAAAGAG AACCGATACC ATTTTCTGGC 

+3PGCL LVK RCGG NCA CCL HNC 
241 CAGGTTGTCT CCTGGTTAAA CGCTGTGGTG GGAACTGTGC CTGTTGTCTC CACAATTGCA 

t3NBCQ CVP SKV T K K Y HEV LQL 
301 ATGAATGTCA ATGTGTCCCA AGCAAAGTTA CTAAAAAATA CCACGAGGTC CTTCAGTTGA 

43RP2CT GVR GLHK SLT DVA LEH 
361 GACCAAAGAC CGGTGTCAGG GGATTGCACA AATCACTCAC CGACGTGGCC CTGGAGCACC 

+3 H E £ C D C V ORGS TGG 
421 ATGAGGAGTG TGACTGTGTG TGCAGAGGGA GCACAGGAGG ATAATGAATT CGAAGCTTGA 

481 TCCGGCTGCT AACAAAGCCC 
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Figure 25. Expression of PDGF domain in &coli 
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Figure 26. DNA and polypeptide sequence used for Exoli expression of the CUB-like domain 



+2 MA MDIG INS D P E S H H H HHH 

1 GGCGATGGCC ATGGATATCG GAATTAATTC GGATCCGGAG TCTCACCATC ACCACCATCA 

+2 ESN L S S K F Q F S S N K E Q N G V Q 
61 TGAATCCAAC CTGAGTAGTA AATTCCAGTT TTCCAGCAAC AAGGAACAGA ACGGAGTACA 

4-2 D P Q H E R I I T V S T N GSIH 5PR 
121 AGATCCTCAG CATGAGAGAA TTATTACTGT GTCTACTAAT GGAAGTATTC ACAGCCCAAG 

+2 FPH T Y P R NTV L V W RLVA VEE 
181 GTTTCCTCAT ACTTATCCAA GAAATACGGT CTTGGTATGG AGATTAGTAG CAGTAGAGGA 

+2 NVW I Q h T FDE R F G LEDP EDD 
241 AAATGTATGG ATACAACTTA CGTTTGATGA AAGATTTGGG CTTGAAGACC CAGAAGATGA 

+2 ICK Y D F V EVE EPS DGTI LGR 
301 CATATGCAAG TATGATTTTG TAGAAGTTGA GGAACCCAGT GATGGAACTA TATTAGGGCG 

+2 W C G SGTV P G K QZS KGKQ IRI 
361 CTGGTGTGGT TCTGGTACTG TACCAGGAAA ACAGATTTCT AAAGGAAATC AAATTAGGAT 

+ 2 RFV S D E Y FPS EPG FCIH Y N I 
421 AAGATTTGTA TCTGATGAAT ATTTTCCTTC TGAACCAGGG TTCTGCATCC ACTACAACAT 

♦2 V M P QFTE AV 
431 TGTCATGCCA CAATTCACAG AAGCTGTGTA GTCGAGCTCC GTCGACAAGC TTGCGGCGGC 

541 ACTCGAGCAC 
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Figure 27. Expression of the CUB domain in E.coli 



M CUB 



25K 
20K 




Express Mail Label No. 

EL024661737US 

Date of Deposit: 12/21/99 



Figure 2S. The Effect of Truncated VEGF-X (CUB domain) on HUVEC Proliferation 
(A) 



Human Umbilical Vein Endothelial Cells (one-day- 
treatment) 



3500 
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Figure 28 Continued 
(B) 



Human Umbilical Vein Endothelial Cells (24-hour- 
starving followed by one-day-treatment) 
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Figure 28 

(Q- The effect of VEGF-A^ and VEGF-X CUB domain on the proliferation of HUVEC (ftiro-day- 
treatment). 



3000 
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Figure 29 Tissue distribution of mRNA 
(A) -Normal tissues 
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Figure 30. Partial intron/exon structure of the VEGF-X gene 



(A) - Genomic DNA sequences of 2 exons determined by sequencing 



t c t C t C 1 1 a t ac ca ta t agcgg toga t c tgaaccagGGTTCTGCATCCACTACAACATTGTCATGCCACAATTCACAGAAGCTGTG 

AGTCCTTCAGTGCTACCCCCTTCAGCTTTGCCACTGG/^ 

TCGATATCTTGAACCAGAGAGATGGCAGTTGGACTTA^ 

TTGGAAGAAAAT CCAGAG TG GTGGATCTGAACC TT CT AACAGAGGAGGTAAGAT T AT ACAG CTGCACACCTCGTAACTTCTCAG TG 
TCCATAAGGGAAGAACTAAAOkGA^CCGATACCATTTT 

TTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAATACCACGA^ tagg t a C acaa ttttcttttt 
ggcccccttcgggtattctacgtcec 

aaagccagccatagacattcgtcgacccccaaaagtggcttactcctactccccttcagGTCCTTCAGTTGAGACCAAAGACCGGT 
GTCAGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAG 

ATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTT 
CCTTAATCTWGTTGTTTGCTTCAAGGACCTTTCM 

GAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGT 

AAATAGATCACCAGCT AGTTTCAGAGTTAC CATGTACGTAT TCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAG 

GGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATT^ 

CTAAAATCGTATAAAATCTGGATTTTTTTTTTTTTTTTTGCGCATATTCACATA^ 

TGGTTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTT^^ 

AATTTCTGCCATTTAGAAGAAGAGAACTACATTCATGGTTTGGAAGAGATAAAC 

TCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTCT 

AATATATCTATTTTTACCAAAGGTATTTAATATTCTTTTTTATGACAAOT 

ACACAATTGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGT^ 

CTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGTA 

TCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTT^^ 

TCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCT^ 

TGCTGTGCAGTAGGAACACATCCTATTTATTGTGATGXTGTGGT^ 

TGGATATTTTTATGTACAGAAGTATGTCTCTTAACCAGTTCACTTATTGTACTCTC^ 

TGCTTGTAAAATGCTTAATATCGTGCCTAGGTTATGTGGTGACTAT^ 

GGCTATTTTGGGGAGAAAATTa t g tg tgtg tg cgc ccaaga C t ca 1 1 c c t cgga c t cc gagaaaa tgaaag ac aaa 
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Figure 30 continued 

(B) - Location of splice sites within the cDNA sequence 



1 GAATTCGCCC TTTTGTTTM ACCTTGGGAA CTGGTTCAGG TCCAGGTTTT GCTTTGATCC 
€1 TTTTCAAAAA CTGGAGACAC AGAAGAGGGC TCTAGGAAAA AGTTTTGGAT GGGATTATGT 
121 GGAAACTACC CTGCGATTCT CTGCTGCCAG AGCAGGCTCG GCGCTTCCAC CCCAGTGCAG 

181 CCTTCCCCTG GCGGTGGTGA aagagactcg ggagtcgctg cttccaaagt gcccgccgtg * 

+ j MShFGhLLLTS 
241 AGTGAGCTCT CACCCCAGTC AGCCAAATGA GCCTCTTCGG GCTTCTCCTG CTGACATCTG 

♦3 A L A C 0* 0 C T 0 A ESN L S S K F 0 
301 CCCTGGCCGG CCAGAGACAG GGGACTCAGG CGGAATCCAA CCTGAGTAGT AAATTCCAGT 

+3 F S S It K E 0 N G V Q D P Q HER I 1 T 
362. TTTCCAGCAA CAAGGAACAG AACGGAGTAC AAGATCCTCA GCATGAGAGA ATTATTACTG 

+3 V S T N G S I H S P R F P H T Y P R W T 
421 TGTCTACTAA TGGAAGTATT CACAGCCCAA GGTTTCCTCA TACTTATCCA AGAAATACGG 

+3 V X V W R L V A V E E N V W X 0 I T F D 
481 TCTTGGTATG GAGATTAGTA GCAGTAGAGG AAAATGTATG GATACAACTT ACGTTTGATG 

+3 B R F G LED P E D D I C K Y D F V S V 
541 AAAGATTTGG GCTTGAAGAC CCAGAAGATG ACATATGCAA GTATGATTTT GTAGAAGTTG 

+3 E E P S DOT I L G R W C G $ G T V P G 
€01 AGGAACCCAG TGATGGAACT ATATTAGGGC GCTGGTGTGG TTCTGGTACT GTACCAGGAA 

+3 K 0 I S K G N 0 X R 1 R F V S D E Y F P 
6€1 AACAGATTTC TAAAGGAAAT CAAATTAGGA TAAGATTTGT ATCTGATGAA TATTTTCCTT 

+3 S E P |g PCI HYNI VMP .QFT E A V 
721 CTGAACCAGG GTTCtGCATC CACTACAACA TTGTCATGCC ACAATTCACA GAAGCTGTGA 

+3SPSV LPP SALP LDL LNN AIT 
781 GTCCTTCAGT GCTACCCCCT TCAGCTTTGC CACTGGACCT GCTTAATAAT GCTATAACTG 

+3AFST LSD I, I R Y L E P E R W QLD 
841 CCTTTAGTAC CTTGGAAGAC CTTATTCCAT ATCTTGAACC AGAGAGATGG CAGTTGGACT 

+3LEDL YRP TWQL LGK A F V PGR 
901 TAGAAGATCT ATATAGGCCA ACTTGGCAAC TTCTTGGCAA GGCTTTTGTT TTTGGAAGAA 

+3KSRV VDL NLLT E A N R L Y S C T 
9S1 AATCCAGAGT GGTGGATCTG AACCTTCTAA CAGAGGA^ST AAGATTATAC AGCTGCACAC 

+3PRNF SVS IREE LKR TDT IFW 
1021 CTCGTAACTT CTCAGTGTCC ATAAGGGAAG AACTAAAGAG AACCGATACC ATTTTCTGGC 

+3PGCL LVK RCGG NCA CCL KNC 
1081 CAGGTTGTCT CCTGGTTAAA CGCTGTGGTG GGAACTGTGC CTGTTGTCTC CACAATTGCA 

+ 3NECQ CVP SKVT KKY H E I V 1> Q h 
1141 ATGAATGTCA ATGTGTCCCA AGCAAAGTTA CTAAAAAATA CCACGAG|3TC CTTCAGTTGA 
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+3RPKT GVR GLHK SLT DVA LEH 
1201 GACCAAAGAC CGGTGTCAGG GGATTGCACA AATCACTCAC CGACGTGGCC CTGGAGCACC 

+ 3 H E E C DCV CRGS TGG 
1261 ATGAGGAGTG TGACTGTGTG TGCAGAGGGA GCACAGGAGG ATAGCCGCAT CACCACCAGC 

1321 AGCTCTTGCC CAGAGCTGTG CAGTGCAGTG GCTGATTCTA TTAGAGAACG TATGCGTTAT 

13 81 CTCCATCCTT AATCTCAGTT GTTTGCTTCA AGGACCTTTC ATCTTCAGGA TTTACAGTGC 

1441 ATTCTGAAAG AGGAGACATC AAACAGAATT AGGAGTTGTG CAACAGCTCT TTTGAGAGGA 

1501 GGCCTAAAGG ACAGGAGAAA AGGTCTTCAA TCGTGGAAAG AAAATTAAAT GTTGTATTAA 

1561 ATAGATCACC AGCTAGTTTC AGAGTTACCA TGTACGTATT CCACTAGCTG GGTTCTGTAT 

1621 TTCAGTTCTT TCGATACGGC TTAGGGTAAT GTCAGTACAG GAAAAAAACT GTGCAAGTGA 

1681 GCACCTGATT CCGTTGCCTT GCTTAACTCT AAAGCTCCAT GTCCTGGGCC TAAAATCGTA 

1741 TAAAATCTGG ATTTTTTTTT TTTTTTTTTG CTCATATTCA CATATGTAAA CCAGAACATT 

1801 CTATGTACTA CAAACCTGGT TTTTAAAAAG GAACTATGTT GCTATGAATT AAACTTGTGT 

1861 CATGCTGATA GGACAGACTG GATTTTTCAT ATTTCTTATT AAAATTTCTG CCATTTAGAA 

1921 GAAGAGAACT ACATTCATGG TTTGGAAGAG ATAAACCTGA AAAGAAGAGT GGCCTTATCT 

1931 TCACTTTATC GATAAGTCAG TTTATTTGTT TCATTGTGTA CATTTTTATA TTCtCCXTTT 

2041 GACATTATAA CTGTTGGCTT TTCTAATCTT GTTAAATATA TCTATTTTTA CCAAAGGTAT 

2101 TTAATATTCT TTTTTATGAC AACTTAGATC AACTATTTTT AGCTTGGTAA ATTrTTCTAA 

2161 ACACAATTGT TATAGCCAGA GGAACAAAGA TGATATAAAA TATTGTTGCT CTGACAAAAA 

2221 TACATGTATT TCATTCTCGT ATGGTGCTAG AGTTAGATTA ATCTGCATTT TAAAAAACTG 

2281 AATTGGAATA GAATTGGTAA GTTGCAAAGA CTTTTTGAAA ATAATTAAAT TATCATATCT 

2341 TCCATTCCTG TTATTGGAGA TGAAAATAAA AAGCAACTTA TGAAAGTAGA CATTCAGATC 

2401 CAGCCATTAC TAACCTATTC CTTTTTTGGG GAAATCTGAG CCTAGCTCAG AAAAACATAA 

2461 AGCACCTTGA AAAAGACTTG CCAGCTTCCT GATAAAGCGT GCTGTGCTGT GCAGTAGGAA 

252 1 CACATCCTAT TTATTGTGAT GTTGTGGTTT TATTATCTTA AACTCTGTTC CATACACTTG 

2581 TATAAATACA TGGATATTTT TATGTACAGA AGTATGTCTC TTAACCAGTT CACTTATTGT 

2641 ACCTGGAAGG GCGAATTCTG CAGATATC 
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